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DESCRIPTION 



PESTICIDALLY ACTIVE PROTEINS AND POLYNUCLEOTIDES 
OBTAINABLE FROM PAENIBA CILL US SPECIES 

Cross-Reference to Related Applications 
[oooi] This application claims priority to provisional application Serial No. 60/392,633, filed 

June 28, 2002, and to provisional application Serial No. 60/441,647, filed January 21, 2003. 



Background of the Invention 

[0002] Insects and other pests cost fanners billions of dollars annually in crop losses and in the 

expense of keeping these pests under control. The losses caused by insect pests in agricultural 
production environments include decreases in crop yield, reduced crop quality, and increased 
harvesting costs. Insect pests are also a burden to vegetable and fruit growers, to producers of 
ornamental flowers, and to home gardeners and homeowners. 

[0003] Cultivation methods, such as crop rotation and the application of high levels of nitrogen 

fertilizers, have partially addressed problems caused by agricultural pests. However, economic 
demands on the utilization of farmland restrict the use of crop rotation. In addition, 
overwintering traits of some insects are disrupting crop rotations in some areas. 

[0004] Thus, synthetic chemical insecticides are relied upon most heavily to achieve a sufficient 

level of control. However, the use of synthetic chemical insecticides can have several drawbacks. 
For example, the use of some of these chemicals can adversely affect many beneficial insects. 
Target insects have also developed resistance to some chemical pesticides. This has been 
partially alleviated by various resistance management strategies, but there is an increasing need 
for alternative pest control agents. Furthermore, very high populations of larvae, heavy rains, and 
improper calibration of insecticide application equipment can result in poor control. The 
improper use of insecticides raises environmental concerns such as contamination of soil and of 
both surface and underground water supplies. Residues can also remain on treated fruits, 
vegetables, and on other treated plants. Working with some insecticides can also pose hazards to 
the persons applying them. Therefore, synthetic chemical pesticides are being increasingly 
scrutinized for their potential toxic environmental consequences. Stringent new restrictions on 
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the use of pesticides and the elimination of some effective pesticides from the market place could 
limit economical and effective options for controlling damaging and costly pests. 
[ooos] Because of the problems associated with the use of synthetic chemical pesticides, there 

exists a clear need to limit the use of these agents and a need to identify alternative control 
agents. The replacement of synthetic chemical pesticides, or combination of these agents with 
biological pesticides, could Teduce the levels of toxic chemicals in the environment. 
[0006] Some biological pesticidal agents that are now being used with some success are derived 

from the soil microbe Bacillus thitringiensis (B.t). The soil microbe Bacillus thuringiensis (B.t.) 
is a Gram-positive, spbre-fbrnring bacterium. Most strains of At do not exhibit pesticidal 
activity. Some At strains produce, and can be characterized by, parasporal crystalline protein 
inclusions. These inclusions often appear microscopically as distinctively shaped crystals. Some 
B.t. proteins are highly toxic to pests, such as insects, and are specific in their toxic activity. 
Certain insecticidal At proteins are associated with the inclusions. These "5-endotoxins" are 
different from exotoxins, which have a non-specific host range. Other species of Bacillus also 
produce pesticidal proteins. 
[0007] Certain Bacillus toxin genes have been isolated and sequenced, and recombinant DNA- 

based products have been produced and approved for use. In addition, with the use of genetic 
engineering techniques, various approaches for delivering these toxins to agricultural 
environments are being perfected. These include the use of plants genetically engineered with 
toxin genes for insect resistance and the use of stabilized intact microbial cells as toxin delivery 
vehicles. Thus, isolated Bacillus toxin genes are becoming commercially valuable. 
[ooos] Commercial use of A t pesticides was initially restricted to targeting a narrow range of 

lepidopteran (caterpillar) pests. Preparations of the spores and crystals of A thuringiensis subsp. 
kurstaU have been used for many years as commercial insecticides for lepidopteran pests. For 
example, A thuringiensis var. kurstaki HD-1 produces a crystalline 8-endotoxin which is toxic to 
the larvae of a number of lepidopteran insects. 
[0009] More recently, new subspecies of At have been identified, and genes responsible for 

active 5-endotoxin proteins have been isolated. Hofte and Whiteley classified At crystal protein 
genes into four major classes (H6fte, H., H.R. Whiteley [1989] Microbiological Reviews 
52(2):242-255). The classes were Cryl (Lepidoptera-specific), CryJl (Lepidoptera- and Diptera- 
specific), CryTR (Coleoptera-specific), and CrylV (Diptera-specific). The discovery of strains 
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specifically toxic to other pests has been reported. For example, CryV and CryVl were proposed 
to designate a class of toxin genes that are nematode-specific. 

(ooioj The Lepidopteran-specific Cryl crystal proteins, in their natural state, are approximately 

130- to 140-kDa proteins, which accumulate in bipyramidal crystalline inclusions during the 
sporulation of B. thuringiensis. These proteins are protoxins which solubilize in the alkaline 
environment of the insect midgut and are proteolytically converted by crystal-associated or larval- 
midgut proteases into a toxic core fragment of 60 to 70 kDa. This activation can also be carried 
out in vitro with a variety of proteases. The toxic domain is localized in the N-terminal half of 
the protoxin. This was demonstrated for CrylAQb) and CrylC proteins through N-terminal amino 
acid sequencing of the trypsin-activated toxin. Hofte et al 1989. Cleavage occurs on the C- 
terminal end of a conserved region called "Block 5," thus forming the C-terminus of the core 
toxin. A short, N-teiminal protoxin segment can also be processed offi The N-terminal cleavage 
site is also highly conserved for CrylA and CryJD proteins, suggesting that for these proteins, the 
N terminus of the toxic fragment is localized at the same position. CryJB, however, is different 
from the other Cryl proteins in this region. It was not known whether this protein is also 
processed at the N terminus. Hofte et al 1989. 

[ooi i] Deletion analysis of several cryl genes further confirmed that the 3* half of the protoxin is 

not required for toxic activity. One of the shortest reported toxic fragments was localized 
between codons 29 and 607 for CrylAb. Further removal of four codons from the 3 ! end or eight 
codons from the 5' end completely abolished the toxic activity of the gene product. Similar 
observations were made for the crylA(a) and cry\A(c) genes. Hofte et al 1989. 

[0012] • The cryTL genes encode 65-kDa proteins which form cuboidal inclusions in strains of 
several subspecies. These crystal proteins were previously designated "P2" proteins, as opposed 
to the 130-kDa PI crystal proteins present in the same strains. Hofte et al 1989. 

[ooi3] A cryUA gene was cloned from B. thuringiensis subsp. kurstaki HD-263 and expressed in 

Bacillus megaterium. Cells producing the CryUA protein were toxic for the lepidopteran species 
Heliothis virescens and Lymantria dispar as well as for larvae of the dipteran Aedes aegypti. 
Widner and Whitely (1989, Bacteriol 171:965-974) cloned two related genes (cryUA and 
cryUS) from B. thuringiensis subsp. kurstaki HD-1 . Both genes encode proteins of 633 amino 
acids with a predicted molecular mass of 71 kDa (slightly larger than the apparent molecular 
mass determined for the P2 proteins produced in B. tfmringiensis). Although the CryUA and 
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CrySB proteins are highly homologous (-87% amino acid identity), they differ in their 
insecticidal spectra. CryOA is active against both a lepidopteran (Manduca sexta) and a dipteran 
(Aedes aegypti) species, whereas cryHB is toxic only to the lepidopteran insect. Hofte et al 
1989. The CryU toxins, as a group, tend to be relatively more conserved at the sequence level 
(>80% identical) than other groups. In contrast, there are many Cryl toxins, for example, 
including some that are less than 60% identical. 

The 1989 nomenclature and classification scheme of Hofte and Whiteley for crystal 
proteins was based on both the deduced amino acid sequence and the host range of the toxin. 
That system was adapted to cover 14 different types of toxin genes which were divided into five 
major classes. The 1989 nomenclature scheme became unworkable as more and more genes 
were discovered that encoded proteins with varying spectrums of pesticidal activity. Thus, a 
revised nomenclature scheme was adopted, which is based solely on amino acid identity 
(Crickmore et ai, 1998, Microbiology and Molecular Biology Reviews 62:807-813). The 
mnemonic "cry" has been retained for all of the toxin genes except cytA and cytB, which remain 
a separate class. Roman numerals have been exchanged for Arabic numerals in the primary rank, 
and the parentheses in the tertiary rank have been removed. Many of the original names have 
been retained, with the noted exceptions, although a number have been reclassified. There are 
now at least 37 primary classes of Cry proteins, and two primary classes of cyt toxins. Other 
types of toxins, such as those of WO 98/18932 and WO 97/40162, have also been discovered 
from B. thuringiensis. 

There are some obstacles to the successful agricultural use of Bacillus (and other 
biological) pesticidal proteins. Certain insects can be refractory to the effects of Bacillus toxins. 
Insects such as boll weevils, black cutworm, and Helicoverpa zea, as well as adult insects of most 
species, heretofore have demonstrated no significant sensitivity to many B.t 5-endotoxins. 

Another potential obstacle is the development ofresistance to B.t toxins by insects. B.t. 
protein toxins were initially formulated as sprayable insect control agents. A more recent 
application of B.t technology has been to isolate and transform plants with genes that encode 
these toxins. Transgenic plants subsequently produce the toxins, thereby providing insect control. 
SeeU.S. Patent Nos. 5,380,831; 5,567,600; and 5,567,862 to Mycogen Corporation. Transgenic 
B. t. plants are quite efficacious, and usage is predicted to be high in some crops and areas. This 
has caused some concern that resistance management issues may arise more quickly than with 
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traditional sprayable applications. While a number of insects have been selected for resistance to 
B.t. toxins in the laboratory, only me damondback moth (/>/ M te// a ^,/ oste // a ) has demonstrated 
resistance in a field setting (Ferre, J. and Van Rie, J., Amu. Rev. Entomol. 47:501-533, 2002). 
[oo,7] Resistance management strategies inB.t. transgene plant technology have become of great 

interest (for example, as in a natural bacterium, multiple diverse toxins can be exposed on the 
sameplant, thereby greatly reducing the chance that an insect that might be resistant to one toxin 
wouldsurviveto spread the resistance). Severalstategiesluivebeensuggestedforpreservingthe 
abihtytoeffectivelyusei*. thuringiensis toxins. These strategies include high dose with refuge, 
and alternation with, or co-deployment of, different toxins (McGaughey et al. (1998), "B.t. 
Resistance Management," Nature Biotechnol 16:144-146). 
mis] Thus, there remains a great need for developing additional genes that can be expressed in 

plants in order to effectively control various insects, m addition to continuaUy trying to discover 
new*./, toxins, it would be quite desirable to discover other bacterial sources (distinct from B. t.) 
that produce toxins that could be used in transgenic plant strategies, or that could be combined 
with B.t.s to produce insect-controlling transgenic plants. 
WW ™ er ^ nte ff°^toclonemsecti^^ 

group of bacteria present potential alternatives to toxins derived from B. thuringiensis. It has 
been known in the art that bacteria of the genus Xenorhabdus are symbiotically associated with 
the Steinernema nematode. Unfortunately, as reported in a number of articles, the bacteria only 
had pesticidal activity when injected into insect larvae and did not exhibit biological activity 
when delivered orally. 
(mo] IthMbe endifficmttoeffi 

bacterial symbiont. Thus, it would be quite desirable to discover proteinaceous agents from 
Xenorhabdus bacteria that have oral activity so that the products produced therefrom could be 
formulated as a sprayable insecticide, or the bacterial genes encoding said proteinaceous agents 
could be isolated and used in the production oftransgenic plants. WO 95/00647 relates to the use 
of Xenorhabdus protein toxin to control insects, but it does not recognize orally active toxins. 
WO 98/08388 relates to orally aciministered pesticidal agents from Xenorhabdus. U.S. Patent 
No. 6,048,838 relates to protein toxins/toxin complexes, having oral activity, obtainable from 
Xenorliabdus species and strains. 
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(0021] Photorhabdus and Xenorliabdus spp. are Gram-negative bacteria that 

entomopathogenically and symbiotically associate with soil nematodes. These bacteria are found 
in the gut of entomopathogenic nematodes that invade and kill insects. When the nematode 
invades an insect host, the bacteria are released into the insect haemocoel (the open circulatory 
system), and both the bacteria and the nematode undergo multiple rounds of replication; the 
insect host typically dies. These bacteria can be cultured away from their nematode hosts. For a 
more detailed discussion of these bacteria, see Forst and Nealson, 60 Microbiol. Rev. 1 (1996), 
pp. 21-43. 

[0022] The genus Xenorhabdus is taxonomically defined as a member of the Family 

Enterobacteriaceae, although it has certain traits atypical of this family. For example, strains of 
this genus are typically nitrate reduction negative and catalase negative. Xenorhabdus has only 
recently been subdivided to create a second genus, Photorhabdus, which is comprised of the 
single species Photorliabdus luminescens (previously Xenorhabdus luminescens) (Boemare et aL, 
1993 Int. J. Syst. Bacterial. 43, 249-255). This differentiation is based on several distinguishing 
characteristics easily identifiable by the skilled artisan. These differences include the following: 
DNA-DNA characterization studies; phenotypic presence (Photorhabdus) or absence 
(Xenorhabdus) of catalase activity; presence (Photorhabdus) or absence (Xenorhabdus) of 
bioluminescence; the Family of the nematode host in that Xenorhabdus is found in 
Steinemematidae and Photorhabdus is found in Heterorhabditidae); as well as comparative, 
cellular fatty-acid analyses (Janse et aL 1990, Lett Appl. Microbiol. 10, 131-135; Suzuki et aL 
1990, /. Gen. Appl. Microbiol., 36, 393-401). In addition, recent molecular studies focused on 
sequence (Rainey et al. 1 995, Int. J. Syst. Bacteriol., 45, 379-381) and restriction analysis (Brunei 
et aL, 1997, App. Environ. Micro., 63, 574-580) of 16S rRNA genes also support the separation 
of these two genera. 

[0023] The expected traits for Xenorhabdus are the following: Gram stain negative rods, white to 

yellow/brown colony pigmentation, presence of inclusion bodies, absence of catalase, inability to 
reduce nitrate, absence of bioluminescence, ability to uptake dye from medium, positive gelatin 
hydrolysis, growth on Enterobacteriaceae selective media, growth temperature below 37° C, 
survival under anaerobic conditions, and motility. 

[0024] Currently, the bacterial genus Xenorhabdus is comprised of four recognized species, 

Xenorhabdus nematophilus, Xenorhabdus poinarii, Xenorhabdus bovienii and Xenorhabdus 
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beddings (Brunei et al., 1997, App. Environ. Micro., 63, 574-580). A variety of related strains 
have been described in the literature (eg., Akhurst and Boemare 1988 J. Gen. Microbiol., 134, 
1835-1845; Boemare et al. 1993 Int. J. Syst. Bacteriol. 43, pp. 249-255; Putz et al. 1990, Appl. 
Environ. Microbiol, 56, 181-186, Brunei etal, 1997 App. Environ. Micro., 63, 574-580, Rainey 
etal. 1995, Int. J. Syst. Bacteriol., 45, 379-381). 
[0025] Xenorhabdus and Photorhabdus bacteria secrete a wide variety of substances into the 

culture medium; these secretions include lipases, proteases, antibiotics and lipopolysaccharides. 
Purification of differentprotease fiactions has clearly demonstrated that they are not involved in 
the oral toxic activity of P. luminesce™ culture medium (which has been subsequently 
determined to reside with the Tc proteins only). "Several of these substances have previously been 
implicated in insect toxicity but until recently no insecticidal genes had been cloned. However, 
protease purification and separation will also fecilitate an examination of their putative role in, 
for example, inhibiting antibacterial proteins such as cecropin. R.H. ffrench-Constant and 
Bowen, Current Opinions in Micriobiology, 1999, 12:284-288. SeeR.H. ffrench-Constant etal. 
66 AEM No. 8, pp. 3310-3329 (Aug. 2000), for a review of various factors involved in 
Photorhabdus virulence of insects. 
[0026] There has been substantial progress in the cloning of genes encoding insecticidal toxins 

from bom Photorhabdus luminescens saAXenorhabdus nematophilus. Toxin-complex encoding 
genes from P. luminescens were examined first. See, e.g., WO 98/08932. "Parallel" genes were 
more recendy cloned from X. nematophilus. Morgan et al, Applied and Environmental 
Microbiology 2001, 67:2062-69. 
[0027] Four different toxin complexes (TCs)— Tea, Tcb, Tec and Ted— have been identified in 

Photorhabdus spp. Each of these toxin complexes resolves as either a single or dimeric species 
on a native agarose gel but resolution on a denaturing gel reveals that each complex consists of a 
range of species between 25-280 kDa. The ORFs that encode the TCs from Photorhabdus, 
togemerwimproteasecleavagesites(verticalarrows),arefllustratedmFlgurel. SeealsoRSL ■ 
ffrench-Constant and Bowen, 57 Cell. Mol. Life Sci. 828-833 (2000). 
[0028] Genomic libraries of P. luminescens were screened with DNA probes and with 

monoclonal and/or polyclonal antibodies raised against the toxins. Four tc loci were cloned: tea, 
tcb, tec and ted. The tea locus is a putative operon of three open reading frames (ORFs), tcaA, 
tcaB, and tcaC transcribed from the same DNA strand, with a smaller terminal ORF (tcaZ) 
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transcribed in the opposite direction. The tec locus also is comprised of three ORFs putatively 
transcribed in the same direction (tccA, tccB, and tccQ. The tcb locus is a single large ORF 
(tcbA), and the ted locus is composed of two ORFs (tcdA and tcdB); tcbA and tcdA, each about 
7.5 kb, encode large insect toxins. TcdB has some homology to TcaC. Many of these gene 
products were determined to be cleaved by proteases. For example, both TcbA and TcdA are 
cleaved into three fragments termed i, ii and iii (eg. TcbAi, TcbAii and TcbAiii). Products of the 
tea and tec ORFs are also cleaved. See Figure 1. See also R.H. ffrench-Constant and D.J. 
Bowen, Current Opinions in Microbiology, 1999, 12:284-288. 
[0029] Bioassays of the Tea toxin complexes revealed them to be highly toxic to first instar 

tomato hornworms (Manduca sexta) when given orally (LD50 of 875 ng per square centimeter of 
artificial diet). R.H. ffrench-Constant and Bowen 1999. Feeding was inhibited at Tea doses as 
low as 40 ng/cm 2 . Given the high predicted molecular weight of Tea, on a molar basis, P. 
Iwninescens toxins are highly active and relatively few molecules appear to be necessary to exert 
a toxic effect. R.H. ffrench-Constant and Bowen, Current Opinions in Micriobiology, 1999, 
12:284-288. 

[0030] None of the four loci showed overall similarity to any sequences of known function in 

GenBank. Regions of sequence similarity raised some suggestion that these proteins (TcaC and 
TccA) may overcome insect immunity by attacking insect hemocytes. R.H. ffrench-Constant and 
Bowen, Current Opinions in Microbiology, 1999, 12:284-288. 

[0031 ] TcaB, TcbA, and TcdA all show amino acid conservation (-50% identity), compared with 

each other, immediately around their predicted protease cleavage sites. This conservation 
between three different TC proteins suggests that they may all be processed by the same or 
similar proteases. TcbA and TcdA also share -50% identity overall, as well as a similar 
predicted pattern of both carboxy- and amino-terminal cleavage. It was postulated that these 
proteins might thus be homologs of one another. Furthermore, the similar, large size of TcbA 
and TcdA, and also the fact that both toxins appear to act on the gut of the insect, may suggest 
similar modes of action. R.H. ffrench-Constant and Bowen, Current Opinions in Microbiology, 
1999, 12:284-288. 

[0032] Deletion/knock-out studies suggest that products of the tea and ted loci account for the 

majority of oral toxicity to lepidopterans. Deletion of either of the tea or ted genes greatly 
reduced oral activity against Manduca sexta. That is, products of the tea and ted loci are oral 
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lepidopteran toxins on their own; their combined effect contributed most of the secreted oral 
activity. RJL ffrench-Constant and DJ. Bowen, 57 Cell. Mol. Life. Sci. 831 (2000). 
Interestingly, deletion of eilher of the tcb or tec loci alone also reduces mortality, suggesting that 
there maybe complex interactions among the different gene products. Thus, products of the tea 
locus may enhance the toxicity of taf products. Alternatively, ted products may modulate the 
toxicity of tea products and possibly other complexes. Noting that the above relates to oral 
activity against a single insect species, tcb or tec loci may produce toxins that are more active 
against other groups of insects (or active via injection directly into the insect haemocoel— the 
normal route of delivery when secreted by the bacteria in vivo). R.H. ffiench-Constant and 
Bowen, Current Opinions in Microbiology, 1999, 12:284-288. 
[0033] WO 0 1/1 1 029 discloses nucleotide sequences that encode TcdA and TcbA and have base 

compositions that have been altered from that of the native genes to make them more similar to 
plant genes. Also disclosed are transgenic plants that express Toxin A and Toxin B. 
[0034] Of the separate toxins isolated from Photorhabdus luminesce™ (W-14), those designated 

Toxin A and Toxin B have been the subject of focused investigation for their activity against 
target insect species of interest (e.g., corn rootworm). Toxin A is comprised of two different 
subunits. The native gene tcdA encodes protoxin TcdA. As determined by mass spectrometry, 
TcdA is processed by one or more proteases to provide Toxin A. More specifically, TcdA is an 
approximately 282.9 kDa protein (25 1 6 aa) that is processed to provide TcdAi (the first 88 amino 
acids), TcdAii (the next 1849 aa; an approximately 208.2 kDa protein encoded by nucleotides 
265-5811 of tcdA), and TcdAiii, an approximately 63.5 kDa (579 aa) protein (encoded by 
nucleotides 5812-7551 oftedA). TcdAii and TcdAiii appear to assemble into a dimer (perhaps 
aided by TcdAi), and the dimers assemble into a tetramer of four (timers. Toxin B is similarly 
derived from TcbA. 

[0035] While the exact molecular interactions of the TC proteins with each other, and their 

mechanism(s)of action, are not currently understood, it is known, for example, thatthe Tea toxin 
complex of Photorhabdus is toxic to Manama sexta. In addition, some TC proteins are known 
to have "stand alone" insecticidal activity, while other TC proteins are known to potentiate or 
enhance the activity of the stand-alone toxins. It is known that the TcdA protein is active, alone, 
against Manduca sexto, but that TcdB and TccC, together, can be used to enhance the activity of 
TcdA. Waterfield, N. et al, Appl. Environ. Microbiol. 2001, 67:5017-5024. TcbA (there is 
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only one Tcb protein) is another stand-alone toxin from Photorhabdus. The activity of this toxin 
(TcbA) can also be enhanced by TcdB together with TccC-like proteins. 

U.S. Patent Application 20020078478 provides nucleotide sequences for two potentiator 
genes, tcdB2 and tccC2, from the ted genomic region of Photorhabdus luminescens W-14. It is 
shown therein that coexpression of tcdB and tccCl with tcdA results in enhanced levels of oral 
insect toxicity compared to that obtained when tcdA is expressed alone. Coexpression of tcdB 
and tccCl with tcdA or tcb A provide enhanced oral insect activity. 

As indicated in the chart below, Tec A has some level of homology with the N terminus of 
TcdA, and TccB has some level of homology with the C terminus of TcdA. TccA and TccB are 
much less active on certain test insects than is TcdA. TccA and TccB from Photorhabdus strain 
W-14 are called "Toxin D." 'Toxin A" (TcdA), "Toxin B" (TcbA), and 'Toxin C" (TcaA and 
TcaB) are also indicated below. Furthermore, TcaA has some level of homology with TccA and 
likewise with the N terminus of TcdA. Still further, TcaB has some level of homology with 
TccB and likewise with the N terminus of TcdA. TccA and TcaA are of a similar size, as are 
TccB and TcaB. TcdB has a significant level of similarity (both in sequence and size) to TcaC. 
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The insect midgut epithelium contains both columnar (structural) and goblet (secretory) 
cells. Ingestion of tea products by M sexta leads to apical swelling and blebbing of large 
cytoplasmic vesicles by the columnar cells, leading to the eventual extrusion of cell nuclei in 
vesicles into the gut lumen. Goblet cells are also apparently affected in the same fashion. 
Products of tea act on the insect midgut following either oral delivery or injection. R.H. flrench- 
Constant and D J. Bowen, Current Opinions in Microbiology, 1999, 12:284-288. Purified tea 
products have shown oral toxicity against Manduca sexta (LD 5 o of 875 ng/cm 2 ). R.H. ffrench- 
Constant and D.J. Bowen, 57 Cell Mol Life Sci. 828-833 (2000). 
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[0039} WO 99/42589 and U.S. PatentNo. 6,281,413 disclose TC-likeORFs from Photorhabdus 

luminescens. WO 00/30453 and WO 00/42855 disclose TC-like proteins from Xenorhabdus. 
WO 99/03328 and WO 99/54472 (and U.S. Patent Nos. 6,174,860 and 6,277,823) relate to other 
toxins from Xenorhabdus and Photorhabdus. 

[0040] Relatively recent cloning efforts in Xenorhabdus nematophilus also appear to have 

identified novel insecticidal toxin genes with homology to the P. luminescens tc loci. See, e.g, 
WO 98/08388 and Morgan et al, Applied and Environmental Microbiology 2001 , 67:2062-69. 
In R.H. flrench-Constant and D.J. Bowen, Current Opinions in Microbiology, 1999, 12:284-288, 
cosmid clones were screened directly for oral toxicity to another lepidopteran, Pieris brassicae. 
One orally toxic cosmid clone was sequenced. Analysis of the sequence in that cosmid suggested 
that there are five different ORF's with similarity to Photorhabdus tc genes; or/2 and or/5 both 
have some level of sequence relatedness to both tcbA and tcdA, whereas orfl is similar to tccB, 
or/3 is similar to tccC and orf4 is similar to tcaC. Importantly, a number of these predicted ORFs 
also share the putative cleavage site documented in P. luminescens, suggesting that active toxins 
may also be protealytically processed. 

[0041] There are five typical Xenorhabdus TC proteins: XptAl, XptA2, XptBl, XptCl, and 

XptDl . XptAl is a "stand-alone" toxin. XptA2 is another TC protein from Xenorhabdus that 
has stand-alone toxin activity. See GENBANK Accession No. AJ308438 for sequences from 
Xenorhabdus nematophilus. XptBl and XptCl are the Xenorhabdus potentiators that can 
enhance the activity of either (or both) of the XptA toxins. XptDl has some level of homology 
with TccB. XptCl has some level of similarity to TcaC. The XptA2 protein ofXenorhabdus has 
some degree of similarity to the TcdA protein. XptBl has some level of similarity to TccC. 

[0042] The finding of somewhat similar, toxin-encoding loci in these two different bacteria is 

interesting in terms of the possible origins of these virulence genes. The X nematophilus cosmid 
also appears to contain transposase-like sequences whose presence may suggest that these loci 
can be transferred horizontally between different strains or species of bacteria. A range of such 
transfer events may also explain the apparently different genomic organization of the tc operons 
in the two different bacteria. Further, only a subset of X nematophilus and P. luminescens strains 
appear toxic to M sexta, suggesting either that different strains lack the tc genes or that they carry 
a different tc gene compliment Detailed analysis of both a strain and toxin phytogeny within, 
and between, these bacterial species should help clarify the likely origin of the toxin genes and 
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how they are maintained in different bacterial populations. R.H. f&ench-Constant and Bowen, 
Current Opinions in Microbiology, 1999, 12:284-288. 

TC proteins and genes have more recently been described from other insect-associated 
bacteria such as Serratia entomophila, an insect pathogen. Waterfield et al, TRENDS in 
Microbiology, Vol. 9, No. 4, April 2001. 

In summary, toxin complex proteins fromP. luminescens and* nematophilus appear to 
have little homology to previously identified bacterial toxins and should provide useful 
alternatives to toxins derived from*, thuringiensis. Although they have similar toxic effects on 
the insect midgut to other orally active toxins, their precise mode of action remains obscure. 
Future work could clarify their mechanism of action. 

Although some Xenorhabdus TC proteins were found to "correspond" (have a similar 
function and some level of sequence homology) to some of the Photorhabdus TC proteins, a 
pveaPhotorhabdus protein shares only about 40% sequence identity with the "corresponding" 
Xenorhabdus protein. This is illustrated below for four "stand-alone" toxins: 



[0046J 





Identity to P.l.W-14TcbA 


Identity to P.l. W-14 TcdA 


Xwi XptAl 


44% 


46% 


XwiXptA2 


41% 


41% 



(Foramorecompletereview, 5e e, e.g., Morgan^/., "Sequence Analysis of Insecticidal Genes 
ftomXenorhabdus nematodes PMFT296 " Vol. 61,Applied and Environmental Microbiology, 

May 2001, pp. 2062-2069.) 

Bacteria of the genus Paenibacillus are distinguishable from other bacteriaby distinctive 
rRNA and phenotypic characteristics (C. Ash et al. (1993), "Molecular identification of rRNA 
group 3 bacilli (Ash, Farrow, Wallbanks and Collins) using a PCR probe test: Proposal forthe 
creation of anew genus Paenibacillus," Antonie Van Leeuwenhoek 64:253-260). Comparative 
16S rRNA sequence analysis demonstrated that the genus Bacillus consisted of at least five 
phyletic lines. Ribosomal RNA group 3 bacilli (of Ash, Farrow, Wallbanks, and Collins (1991), 
comprismg2te«7/usjH>/y^ 

subtilis (the type species of the genus and other aerobic, endospore-forming bacilli) that they 
were reclassified as a new genus, Paenibacillus. 
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[mi] Some species in this genus were known to be pathogenic to honeybees {Paenibacillus 

Larvae) and scarab beetle grubs (P. popilliae and P. lentimorbus). Some other Paenibacillus 
species that have been found to be associated with honeybees, but they are non-pathogens. At 
least 18 additional species are known in this genus, including P. thiaminolyticus; they have no 
known insect association (Shida era/., 1997; Pettersson etal, 1999). Scarabs (coleopterans) are 
senous pests of turf, nurseries, and food crops throughout North America, and are of quarantine 
concern. See U.S. Department of Agriculture, Agricultural Research Service website. 
10048) P. larvae, P. popilliae, and P. lentimorbus are considered obligate insect pathogens 

involved with milky disease of scarab beetles (D.P. Stanly el al. (1992), 'The genus Bacillus: 
insect pathogens," p. 1697-1745, In A. Balows et al, ed., The Procaryotes, 2 nd Ed., Vol. 2, 
Springer-Verlag, New York, NY). These three Paenibacillus species are characteristically slow- 
growing, fastidious organisms that cause disease by an invasive process in which the bacteria 
cross the midgut and proliferate to high numbers in the hemolymph and other tissues. For all 
three species, some general indications of protein involvement in insect pathogenicity have been 
proposed; however, no specific role for a specific protein has been demonstrated. Stahly et al. 
concluded for P. larvae that a question of the involvement of a toxin is an open one, and that the 
precise cause of death in milky disease (of beetles) is not understood. 
[0049] A beetle (coleopteran) toxin, Cry 1 8, has been identified in stains of P. popilliae and P. 

lentimorbus. Cry 1 8 has about 40% identity to Cry2 proteins (Zhang et al, 1 997; Harrison et al, 
2000). While Zhang et al. (1997) speculate that CryU attacks the midgut to facilitate entry of 
vegetative cells to the hemocoel, Harrison et al. note that there is no direct evidence for this role 
and further state that "the role, if any, of the paraspore protein in milky disease is unknown." J. 
Zhang et al. (1997), "Cloning and Analysis of the First cry Gene from Bacillus popilliae;' J. 
Bacteriol. 179:4336-4341; H. Harrison et al. (2000), "Paenibacillus Associated with Milky 
Disease in Central and South American Scarabs," J. Invertebr. Pathol. 76(3): 169-1 75. 
[ooso] Stahlyera/.,Zhangera/.,andHarrisoneia/. all point to the contrast in evidence for the 

role of crystal proteins of B. thuringiensis in intoxication of insects (where the high frequency of 
insect symptoms can be explained by the properties of the specific crystal proteins), versus the 
case of Paenibacillus and milky disease (where there is no such tie to the effects of a specific 
toxin). 
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Thus, while some species of Paenibacillus were known to be pathogenic to certain 
" coleopterans and some associated with honeybees, no strain of Paenibacillus was heretofore 
known to be toxic to lepidopterans. Likewise, TC proteins and lepidopteran-toxic Cry proteins 
have never been reported in Paenibacillus. 

Brief Summary of the Invention 

[0052} This is the first known disclosure of Paenibacillus protein toxins having activity against 

lepidopteran pests. Some species of Pamibacillus were known to be insecticidal, but they had 
activity against grubsfteetles/coleopterans. There have been no known reports of a Paenibacillus 
species or strain having toxicity to lepidopterans. Thus, the subject invention relates generally to 
Paenibacillus species that have activity against lepidopterans, and to screening Paenibacillus 
spp., proteins therefrom, and libraries of clones therefrom for activity against lepidopterans. 

[0053] More specifically, the subject invention initially stemmed from a discovery of a novel 

strain of Paenibacillus referred to herein as DAS 1529. This was a surprising discovery for a 
variety of reasons. This strain produces a unique, lepidopteran-toxic Cry protein. This strain, as 
well as DB482, produce unique, toxin complex (TC)-iike proteins (having some similarity to 
Xenorhabdus/Photorhabdus TCs). Paenibacillus isolate DB482 and toxins obtainable therfefrom 
are highly preferred, and all are within the scope of the subject invention. 

[0054] This is the first known report of Paenibacillus having TC-Iike proteins. Thus, the subject 

invention relates to methods of screening Paenibacillus spp. for TC-like genes and proteins. 
Paenibacillus TC proteins of the subject invention are shown herein to be useful to enhance or 
potentiate the activity of a "stand-alone" Xenorhabdus toxin protein, for example. TC-like genes 
identified herein were not heretofore known to exist in the genus Paenibacillus. This discovery 
broadens the scope of organisms (bacterial genera) in which TC-like genes have been found. 
Thus, the subject invention generally relates to TC-like proteins obtainable from Paenibacillus 
species, to methods of screening Paenibacillus species for such proteins, and the like. One 
example is Paenibacillus apairius, which was also found to produce TC-like proteins. 

[0055] While the subject TC-like proteins have some sequence relatedness to, and characteristics 

in common with, TC proteins ofXenorJiabdus mdPhotorabdus, the sequences of the subject TC- 
like proteins are very different from previously known TC proteins. Thus, the subject application 
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provides new classes of TC-like proteins and genes that encode these proteins, which are 
obtainable from bacteria in the genera Paenibacillus, Photorhabdus, Xenorliabdus, and the like. 
[0056] Another surprising feature of the DAS 1 529 strain is that it produces a unique, B.tAike 

Cry protein that is toxic to lepidopterans. The subject Cry toxin is compressed/short and appears 
to lack a typical protoxin portion in its wild-type state. Thus, the subject invention generally 
relates to screening Paenibacillus isolates for lepidopteran-toxic Cry proteins. The subject 
invention also relates to methods of screening Paenibacillus spp. and B. tlwringiensis, for 
example, for this new class of Cry genes and proteins. 
[0057] The DAS 1 529 strain is the first known example of a natural bacterium that produces both 

a Cry-like toxin and TC-like proteins. Further surprising is that this is the first known example of 
a cry toxin gene being closely associated with (in genetic proximity to) TC protein genes. These 
pioneering observations have broad implications and thus enable one skilled in the art to screen 
appropriate species of bacteria for these types of unique operons and for these types of farther 
components of known operons. Such techniques are within the scope of the subject invention. 
[0058] A further aspect of the subject invention stems from the surprising discovery that the 

DAS 1529 strain also produces a soluble insect toxin that was found to be very similar to a 
thiaminase. It was surprising that the Paenibacillus thiaminase protein was found to have 
insecticidal activity. While this type of protein was known, it was in no way expected in the art 
that this enzyme would have exhibited toxin-like activity against insects/insect-like pests. Thus, 
the subject invention also relates to methods of screening Paenibacillus and others for 
insecticidal thiaminase genes and proteins, and to the use of these genes and proteins for 
controlling insects and like pests. 
[0059] Other objects, advantages, and features of the subject invention will be apparent to one 

skilled in the art having the benefit of the subject disclosure. 

Brief Description of the Figures 
[0060] Figure 1 shows the TC operons from Photorhabdus. 

[mi] Figure 2 shows a diagram of the DNA from DAS1 529 inserted into the "SB12" clone 

that exhibited pesticidal activity, with open reading frames identified with block and line arrows. 

[0062] Figure 3 shows partial sequence alignments for SEQ ID NO: 1 7 and thiaminase I from 

Bacillus thiaminolyticus (Campobasso et al $ 1998) or AAC44156. 
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[0063] Figure 4 shows test results of purified thiaminase from DAS1529 on CEW. 

[0064] Figure 5 shows ORF3-ORF6 in pEtlOlD. 

. Figure 6 shows Cry\ 529 (ORF 7) against tobacco bud worm (TBW). 

Figure 7 shows a comparison/alignment of SEQ ID NO:9 to SEQ ID NO:5 (tcaB 2 to 
tcaBi); the brackets show the ORF2 junction. 

Figure 8 shows aphylogenetic tree of DAS 1529 ORF7 (C/yl529) compared to other Cry 



[0065] 
[0066] 



[0067] 



[0068] 



proteins. 

Figures 9 and 10 show results of trypsin digestion of wild-type and modified Cryl529 



proteins. 

[0069] Figures 11A and 11B show sequence alignments for tcaA primer design. 

[0070] Figures 12A-D show sequence alignments for tcaB primer design. 

Figures 13A and 13B show sequence alignments for tcaC primer design: 
Figures 14A and 14B show sequence alignments for tccC primer design. 



[0071] 
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[0077] 
[0078] 

[0079] 
[0080] 

[0081] 
[0082] 



Brief Description of the Sequences 
SEQ ID NO:l is the nucleic acid sequence of the entire insert of SB12. 
SEQ ID NO:2 is the nucleic acid sequence of ORF1, which encodes a tcaA-like protein 
(gene tcaAl, source organism Paenibacillus strain IDAS 1529, gene designation tcaAl-1529). 
SEQ ID NO:3 is the amino acid sequence encoded by ORF1 . 

SEQ ID NO:4 is the nucleic acid sequence of ORF2, with an IS element removed, which 
encodes a tcaB-like protein (gene tcaBi, source organism Paenibacillus strain IDAS 1529, gene 
designation tcaBl-1529\ 

SEQ ID NO:5 is the amino acid sequence encoded by ORF2. 

SEQ ID NO:6 is the nucleic acid sequence of ORF3, which encodes a tcaA-like protein 
(gene tcaA2, source organism Paenibacillus strain IDAS 1529, gene designation tcaA2-1529). 

SEQ ID NO:7 is the amino acid sequence encoded by ORF3. 

SEQ ID NO:8 is the nucleic acid sequence of ORF4, which encodes a tcaB-like protein 
(gene tcaB2, source organism Paenibacillus strain IDAS 1529, gene designation tcaB2-l529\ 

SEQ ID NO:9 is the amino acid sequence encoded by ORF4. 

SEQ ID NO:10 is the nucleic acid sequence of ORF5, which encodes a tcaC-like protein 
(gene tcaC, source organism Paenibacillus strain IDAS 1529, gene designation tcaC-l529\ 
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[0083] SEQ ID NO:ll is the amino acid sequence encoded by ORF5. 

[0084] SEQ ID NO: 12 is the nucleic acid sequence of ORF6, which encodes a tccC-Hke protein. 

[0085] SEQ ID NO:13 is the amino acid sequence encoded by ORF6. 

[0086] SEQ ID NO:14 is the nucleic acid sequence of ORF7, which encodes a Cry-like protein. 

[0087] SEQ ID NO:15 is the amino acid sequence encoded by ORF7. 

[0088] SEQ ID NO:16 is the partial nucleic acid sequence of the 16S rDNA of DAS 1529 used 

for taxonomic placement. 
[0089] SEQ ID NO: 17 is the N-terminal amino acid sequence for the purified toxin from the 

broth fraction from DAS 1 529. 
[0090] SEQ ID NO:l 8 is the amino acid sequence of thiaminase I framBacillus thiaminolyticus 

(Campobasso etaU J- Biochem. 37(45):15981-15989 (1998)). 
[0091] SEQ ID NO:19 is an alternate amino acid sequence encoded by ORF6 protein (gene 

tccC, source organism Paenibacillus strain IDAS 1529, gene designation tccC-1529). 
[0092] SEQ ID NO:20 is gene xptCl, source organism Xenorhabdus strain Xwi, gene 

designation xptC 1 -Xwi. 

[0093] SEQ ID NO:21 is gene xptBl, source organism Xenorhabdus strain Xwi, gene 

designation xptBl-Xwi. 
[0094] SEQ ID NO:22 is primer SB101. 

[0095] SEQ ID NO:23 is primer SB102. 

[0096] SEQ ID NO:24 is primer SB103. 

[0097] SEQ ID NO:25 is primer SB 1 04. 

[0098] SEQ ID NO:26 is primer SB105. 

[om] SEQ ID NO:27 is primer SB106. 

[oowo] SEQ ID NO:28 is primer SB212. 

[ooioi] SEQ ID NO:29 is primer SB213. 

[0OW2] SEQ ID NO:30 is primer SB21 5. 

[ooi03] SEQ ID NO:31 is primer SB217. 

[ooio4] SEQ ID NO:32 is a nucleotide sequence from a tcaA-like gene from Paenibacillus 

apairius strain DB482. 

[ooios] SEQ ID NO:33 is an amino acid sequence from a TcaA-like protein from Paenibacillus 

apairius strain DB482. 
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[oow6] SEQ ID NO:34 is a nucleotide sequence from a tcaB-like gene from Paenibacillus 

apairius strain DB482. 

[oow7j SEQ ID NO:35 is a nucleotide sequence from a tcaB-like gene from Paenibacillus 

apairius strain DB482. 

[ooios] SEQ ID NO:36 is an amino acid sequence from a TcaB-like protein from Paenibacillus 

apairius strain DB482. 

[oow9] SEQ ID NO:37 is an amino acid sequence from a TcaB-like protein from Paenibacillus 

apairius strain DB482. 

[oouo] SEQ ID NO:38 is a nucleotide sequence from a tcaC-like gene from Paenibacillus 

apairius strain DB482. 

[ooni] SEQ ID NO:39 is an amino acid sequence from a TcaC-like protein from Paenibacillus 

apairius strain DB482. 

[ooii2] SEQ ID NO:40 is a nucleotide sequence from a tccC-like gene from Paenibacillus 

apairius strain DB482. 

[ooii3] SEQ ID NO:41 is an amino acid sequence from a TccC-like protein from Paenibacillus 

apairius strain DB482. 

[ooiN] SEQ ID NO:42 is gene tcdBl, source organism Photorhabdus strain W14, gene 

designation tcdBl-W14. 

[ooiis] SEQ ID NO:43 is gene tcdB2, source organism Photorhabdus strain W14, gene 

designation tcdB2-W14. 

[00116] SEQ ID NO:44 is gene tccCl, source organism Photorhabdus strain W14, gene 

designation tccCl-W14. 

[ooii7] SEQ ID NO:45 is gene tccC2, source organism Photorhabdus strain W14, gene 

designation tccC2-W14. 

[ooiis] SEQ ID NO:46 is gene tccC3, source organism Photorhabdus strain W14, gene 

designation tccC3-W14. 

[ooii9] SEQ ID NO:47 is gene tccC4, source organism Photorhabdus strain W14, gene 

designation tccC4-W14. 

[00120] SEQ ED NO:48 is gene tccCS, source organism Phtorhabdus strain W14, gene 

designation tccC5~Wl 4. 
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[00121] SEQ ID NO:49 is the amino acid sequence of the XptA2 TC protein from Xenorhabdus 

nematophilus Xwi. 

Detailed Description of the Invention 
[00122] The subject invention provides unique biological alternatives for pest control. More 

specifically, the subject invention provides new sources of proteins that have toxin activity 
against insects, preferably lepidopterans, and other similar pests. The invention also relates to 
new sources of novel polynucleotides that can be used to encode such toxins, and to methods of 
making and methods of using the toxins and corresponding nucleic acid sequences to control 
insects and other like plant pests. The present invention addresses the need for novel insect 
control agents. The present invention relates to novel pesticidal proteins that are obtainable from 
Paenibacillus, and other, bacteria. 
[00123] The subject invention initially stemmed from a discovery of a novel strain of 

Paenibacillus. This strain is referred to herein as DAS 1529. To demonstrate the broad 
implications of this discovery, the discovery of another Paenibacillus strain is also exemplified. 
These strains have been deposited with the Agricultural Research Service Patent Culture 
Collection (NRRL) at 1815 North University Street Peoria, 111. 61604 U.SA. The deposited 
strains and the corresponding deposit dates and deposit numbers are as follows: 

Deposited Strain Deposit Date Deposit Number 

DAS 1 529 June 1 9, 2002 NRRL B-30599 

DB482 June 1 7, 2003 NRRL B-30670 

[00124] These cultures have been deposited for the purposes of this patent application and were 

deposited under conditions that assure that access to the cultures is available during the pendency 
of this patent application to one determined by the Commissioner of Patents and Trademarks to 
be entitled thereto under 37 CFR 1.14 and 35 U.S.C. 122. These deposits will be available as 
required by foreign patent laws in countries wherein counterparts of the subject application, or its 
progeny, are filed. However, it should be understood that the availability of a deposit does not 
constitute a license to practice the subject invention in derogation of patent rights granted by 
governmental action. 

[ooi25] Further, the subject culture deposits were made in accordance with the provisions of the 

Budapest Treaty for the Deposit of Microorganisms, i.e. 9 they will be stored with all the care 
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necessary to keep them viable and uncontaminated for a period of at least five years after the 
most recent request for the furnishing of a sample of the deposit, and in any case, for a period of 
at least thirty (30) years after the date of deposit or for the enforceable life of any patent which 
may issue disclosing the culture. The depositor acknowledges the duty to replace the deposit 
should the depository be unable to furnish a sample when requested, due to the condition of the 
deposit All restrictions on the availability to the public of the subject culture deposits will be 
irrevocably removed upon the granting of a patent disclosing them. 

The discovery of the subject DAS 1 529 strain was surprising for a variety of reasons. This 
strain produces a unique, lepidopteran-toxic Cry protein. This strain, as well as DB482, also 
produce unique, toxin complex (TQ-like proteins (having some similarity to 
Xenorhabdus/Photorhabdus TCs). Paenibacillus isolate DB482 and toxins obtainable therefrom 
are highly preferred, and all are within the scope of the subject invention. 

This is the first known disclosure of a Paenibacillus protein toxin having activity against 
a lepidopteran pest. The DAS 1 529 strain was found to have toxin activity against lepidopteran 
pests. This was a surprising discovery. Some species of Paenibacillus were known to have 
insecticidal activity against grubs/beetles/coleopterans. There have been no known reports of a 
Paenibacillus species or strain having toxicity to lepidopterans. Thus, the subject invention 
relates generally to Paenibacillus species that have activity against lepidopterans, and to 
screening Paenibacillus cultures, proteins therefrom, and libraries of clones therefrom, for 
activity against lepidopterans, and/or for genes that encode "lep toxins," and more particularly, 
for lepidopteran-toxic Cry proteins. 

This is also the first known report of Paenibacillus having TC-like proteins. Thus, the 
subject invention relates to methods of screening Paenibacillus spp. for TC-like genes and 
proteins. It was very surprising to find that the DAS1529 and DB482 strains have TC-like 
operons and produce TC proteins (having some level of similarity to TC proteins of Xenorhabdus 
and Photorhabdus). TC proteins and genes identified herein were not heretofore known to exist 
in the genus Paenibacillus. This discovery broadens the scope of organisms (bacterial genera) in 
which TC protein genes have been found. Thus, the subject invention generally relates to TC 
proteins obtainable from Paenibacillus species, to methods of screemngPaefiibacillus species for 
such proteins, and the like. An example of a Paenibacillus species found using the methods of 
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the subject invention is Paenibacillus apairius strain DB482. This P. apairius strain also 
produces unique TC-like proteins. 

[00129] While the subject TC proteins have some characteristics in common with TC proteins of 

Xenorhabdus and Photorabdus, the subject TC proteins are unique and different from previously 
known TC proteins. Thus, the subject application provides new classes of TC-like proteins and 
genes that encode these proteins obtainable from bacteria in the genera Paenibacillus, 
Photorhabdus, Xenorhabdus, Serratia, and the like. 

[00130] The subject invention also relates to lepidopteran-toxic Cry proteins that are obtainable 

from Paenibacillus species. Thus, the subject invention relates to methods of screening 
Paenibacillus species for cry genes and Cry proteins that have toxin activity against a 
lepidopteran pest. 

[ooni] The DAS1 529 Cry toxin is a very unique, B.t. -like Cry protein toxin. One other strain of 

Paenibacillus, a strain with activity against grubs, was known to produce a coleopteran-toxic Cry 
protein. That was a Cry 1 8 protein, which was most related to Cryl proteins (but only about 40% 
identity). The Cry protein exemplified herein shows only a low level of sequence identity and 
similarity to previously known Cry proteins. With that noted, of all the known B. t. Cry proteins, 
the subject Cry protein shares the most similarity to Cryl proteins. One surprising aspect of the 
subject Cry protein is that it is very short, even shorter than the Cry lFa core toxin. The 
subject Cry protein has an identifiable Block 5 region at or near its C terminus. This toxin in its 
wild-type state has no protoxin portion, which is typically found on Cryl toxins. The subject Cry 
toxin is surprisingly compressed. Thus, the subject invention generally relates to a new class of 
Cry proteins. This disclosure is also significant to the search for additional cry genes from 
Bacillus thuringiensis (B.t). As would be clear to one skilled in the art having the benefit of the 
subject disclosure, other bacteria, such as B.t. and other Bacillus spp. (including sphaericus) 
could be screened for similar toxins and toxin genes. These methods of screening are within the 
scope of the subject invention. 

[00132] The DAS 1 529 strain is the first known example of a natural bacterium that produces both 

a Cry-like toxin and TC-like proteins. Further surprising is that this is the first known example of 
a cry toxin gene being closely associated with (in genetic proximity to) TC protein genes. These 
pioneering observations thus enable one skilled in the art to screen appropriate species ofbacteria 
for these types of unique operons and for these types of further components of known operons. 
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Such techniques are within the scope of the subject invention. The DAS 1529 strain is an 
interesting example of a wild type strain having a TC-like operon with multiple TC protein genes 
of the same general type (i.e., in this case, two tcaA-like and two tcaB-like genes). This could 
have implications for further gene discovery. 

[ooi 33] A further aspect of the subject invention stems from the surprising discovery that the 

Paetiibacillus thiaminase protein has insecticidal activity. While this protein was known, it was 
in no way expected in the art that this enzyme would have exhibited toxin-like activity against 
insects/insect-like pests. 

[ooi34] Paenibacillus TC proteins 

[ooi35] More specifically regarding the exemplified TC proteins, the following TC proteins from 

strain DAS 1529 have been folly characterized herein: two TcaA-like proteins (TcaAi and 
TcaA 2 ), two TcaB-like proteins (TcaBi and TcaB 2 ), a TcaC protein, and a TccC-like protein. The 
TcaAi and TcaA 2 proteins are highly similar to each other at the sequence level, and the tcaBi 
and tcaB 2 proteins are highly similar to each other at the sequence level. TC-like proteins 
obtainable from Paenibacillus apairius are also exemplified herein, and are within the scope of 
the subject invention. 

[ooi36] The TC proteins of the subject invention can be used like other TC proteins. This would 

be readily apparent to one skilled in the art having the benefit of the subject disclosure when 
viewed in light of what was known in the art. See, e.g., the Background section, above, which 
discusses RJL ffrench-Constant and Bowen (2000) and U.S. Patent No. 6,048,838. For example, 
it was known that the Tea toxin complex of Photorhabdus is highly toxic to Manduca sexta. 

[00137] While the exact molecular interactions of the TC proteins with each other, and their 

mechanism(s) of action, are not currently understood, some TC proteins were known to have 
"stand alone" insecticidal activity, and other TC proteins were known to enhance the activity of 
the stand-alone toxins produced by the same given organism. For example, it was known that the 
TcdA protein was active against Maiiduca sexta. TcaC and TccC, together, can be used to 
enhance the activity of TcdA. TcdB can be used (in place of TcaC) with TccC as a potentiator. 
TcbA is another Photorhabdus TC protein with stand-alone toxin activity. TcaC (or TcdB) 
together with TccC can also be used to enhance/potentiate the toxin activity of TcbA. 

[ooi 38] Photorhabdus TC proteins and "corresponding" TC proteins/genes from Paenibacillus 

are summarized below. 
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Photorhabdus 








strain W14 


Photorhabdus 


Paenibacillus 


Photorhabdus 


nomenclature 


Self homology 
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TcaA 


Toxin C 


TccA 


ORF3 (& 1) 


TcaB 


TccB 


ORF4(&2) 


TcaC 




TcdB 


ORF5 


Tcb 


Toxin B 






TccA 


Toxin D 


TcdA N terminus 




TccB 


TcdA C terminus 




TccC 






ORF6 


TcdA 


Toxin A 


TccA + TccB 




TcdB 




TcaC 





loom) As indicated above, TccA has some level of homology with the N terminus of TcdA, and 

TccB has some level of homology with the C terminus of TcdA. Furthermore, TcdA is about 280 
kDa, and TccA together with TccB are of about the same size, if combined, as TcdA. 
Furthermore, TcaA has some level of homology with TccA and likewise with the N terminus of 
TcdA. Still further, TcaB has some level of homology with TccB and likewise with the N 
terminus of TcdA. TccA and TcaA are of a similar size, as are TccB and TcaB. 

100140) Although some Xenorhabdus TC proteins were found to "correspond" to some of the 

Photorhabdus TC proteins, the "corresponding" proteins share only about 40% (approximately) 
sequence identity with each other. The subject TC proteins from Paenibacillus have about that 
same degree of sequence relatedness (-40% identity) with prior TC proteins. 

[ooi4i) As described in more detail below, one or more toxins of the subject invention can be 

used in combination with each other and/or with other toxins (z.e., the Photorhabdus Tea 
complex was known to be active against Manduca sexta; various "combinations" of 
Photorhabdus TC proteins, for example, can be used together to enhance the activity of other, 
stand-alone Photorhabdus toxins; the use of Photorhabdus toxins "with" B.l toxins, for example, 
has been proposed for resistance management.) Furthermore, Paenibacillus TC proteins of the 
subject invention are shown herein to be useful to enhance or potentiate the activity of a "stand- 
alone" Xenorhabdus toxin protein, for example. Provisional application No. 60/441,723 
(Timothy D. Hey et a/.), entitled "Mixing and Matching TC Proteins for Pest Control," relates to 
the surprising discovery that a TC protein derived from an organism of one genus such as 
Photorhabdus can be used interchangeably with a "corresponding" TC protein derived from an 
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organism of another genus. Further surprising data along these lines is presented below which 
further illustrate the utility of the Paeiiibacillus TC proteins of the subject invention. One reason 
that these results might be surprising is that there is only -40% sequence identity between 
"corresponding" Xhenorhabdus, Photorhabdus, and the subject Paeiiibacillus TC proteins. 
[m42) Proteins and toxins . The present invention provides easily administered, functional 

proteins. The present invention also provides a method fbT delivering insecticidal toxins that are 
functionally active and effective against many orders of insects, preferably lepidopteran insects. 
By "functional activity" (or "active against") it is meant herein that the protein toxins function as 
orally active insect control agents (alone or in combination with otherproteins), thatthe proteins 
have a toxic effect (alone or in combination with other proteins), or are able to disrupt or deter 
insect growth and/or feeding which may or may not cause death of the insect. When an insect 
comes into contact with an effective amount of a "toxin" of the subject invention delivered via 
transgenic plant expression, formulated protein composition(s), sprayable protein compositions), 
a bait matrix or other delivery system, the results are typically death of the insect, inhibition of 
the growth and/or proliferation of the insect, and/or prevention of the insects from feeding upon 
the source (preferably a transgenic plant) that makes the toxins available to the. insects. 
Functional proteins of the subject invention can also enhance or improve the activity of other 
toxin proteins. Thus, terms suchas "toxic," "toxicity," "toxin activity," and "pesticidally active" 
as used herein are meant to convey that the subject "toxins" have "functional activity" as defined 
herein. 

[ooi43] Complete lethality to feeding insects is preferred, butis not required to achieve functional 

activity. If an insect avoids the toxin or ceases feeding, that avoidance will be useful in some 
applications, even if the effects are sublethal or lethality is delayed or indirect For example, if 
insect resistant transgenic plants are desired, the reluctance of insects to feed on the plants is as 
useful as lethal toxicity to the insects because the ultimate objective is avoiding insect-induced 
plant damage. 

[00144] There are many other Ways in which toxins can be incorporated into an insect's diet For 

example, it is possible to adulterate the larval food source with the toxic protein by spraying the 
food with a protein solution, as disclosed herein. Alternatively, the purified protein could be 
genetically engineered into an otherwise harmless bacterium, which could then be grown in 
culture, and either applied to the food source or allowed to reside in the soil in an area in which 
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insect eradication was desirable. Also, the protein could be genetically engineered directly into 
an insect food source. For instance, the major food source for many insect larvae is plant 
material. Therefore the genes encoding toxins can be transferred to plant material so that said 
plant material expresses the toxin of interest. 

fooj45j Transfer of the functional activity to plant or bacterial systems typically requires nucleic 

acid sequences, encoding the amino acid sequences for the toxins, integrated into a protein 
expression vector appropriate to the host in which the vector will reside. One way to obtain a 
nucleic acid sequence encoding a protein with functional activity is to isolate the native genetic 
material from the bacterial species which produce the toxins, using information deduced from the 
toxin's amino acid sequence, as disclosed herein. The native sequences can be optimized for 
expression in plants, for example, as discussed in more detail below. Optimized polynucleotide 
can also be designed based on the protein sequence. 

[00146] The subject invention provides new classes of toxins having advantageous pesticidal 

activities. One way to characterize these classes of toxins and the polynucleotides that encode 
them is by defining a polynucleotide by its ability to hybridize, under a range of specified 
conditions, with an exemplified nucleotide sequence (the complement thereof and/or a probe or 
probes derived from either strand) and/or by their ability to be amplified by PCR using primers 
derived from the exemplified sequences. 

[00147] There are a number of methods for obtaining the pesticidal toxins of the instant invention. 

For example, antibodies to the pesticidal toxins disclosed and claimed herein can be used to 
identify and isolate other toxins from a mixture of proteins. Specifically, antibodies may be 
raised to the portions of the toxins which are most constant and most distinct from other toxins. 
These antibodies can then be used to specifically identify equivalent toxins with the characteristic 
activity by immunoprecipitation, enzyme linked immunosorbent assay (EOSA), or western 
blotting. Antibodies to the toxins disclosed herein, or to equivalent toxins, or to fragments of 
these toxins, can be readily prepared using standard procedures. Monoclonal, polyclonal, 
specific, and/or cross-reactive antibodies can be made and used according to the subject 
invention. Such antibodies can be included in test kits for detecting the presence of proteins (and 
antigenic fragments thereof) of the subject invention. 

[00148] One skilled in the art would readily recognize that toxins (and genes) of the subject 

invention can be obtained from a variety of sources. A toxin "from" or "obtainable from" the 
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subject DAS 1 529 isolate and/or the P. apiarius isolate means that the toxin (or a similar toxin) 
can be obtained from this isolate or some other source, such as another bacterial strain or a 
transgenic plant For example, one skilled in the art will readily recognize that, given the 
disclosure of a bacterial gene and toxin, a plant can be engineered to produce the toxin. Antibody 
preparations, nucleic acid probes (DNA and RNA), and the like may be prepared using the 
polynucleotide and/or amino acid sequences disclosed herein and used to screen and recover 
other toxin genes from other (natural) sources. Toxins of the subject invention can be obtained 
from a variety of sources/source microorganisms. 
f00I49 j Polynucleotides and probes . Thesubjectinventionfurflierprovidesnucleotidesequences 

that encode the toxins of the subject invention. The subject invention further provides methods 
of identifying and characterizing genes that encode pesticidal toxins. In one embodiment, the 
subject invention provides unique nucleotide sequences that are useful as hybridization probes 
and/or primers for PCR techniques. The primers produce characteristic gene fragments that can 
be used in the identification, characterization, and/or isolation of specific toxin genes. The 
nucleotide sequences of the subject invention encode toxins that are distinct from previously 
described toxins. 

[ooi so] The polynucleotides of the subject invention can be used to form complete "genes" to 

encode proteins or peptides in a desired host cell. For example, as the skilled artisan would 
readily recognize, the subject polynucleotides can be appropriately placed under the control of a 
promoter in a host of interest, as is readily known in the art. 

[ooisi] As the skilled artisan knows, DNA typically exists in a double-stranded form. In this 

arrangement, one strand is complementary to the other strand and vice versa. As DNA is 
replicated in a plant (for example), additional complementary strands of DNA are produced. The 
"coding strand" is often used in the art to refer to the strand that binds with the anti-sense strand. 
The mRNA is transcribed from the "anti-sense" strand of DNA. The "sense" or "coding" strand 
has a series of codons (a codon is three nucleotides that can be read as a three-residue unit to 
specify a particular amino acid) that can be read as an open reading frame (ORF) to form a 
protein or peptide of interest. In order to express a protein in vivo, a strand of DNA is typically 
transcribed into a complementary strand of mRNA which is used as the template for the protein. 
Thus, the subject invention includes the use of the exemplified polynucleotides shown in the 
attached sequence listing and/or equivalents including the complementary strands. RNA and 



i 



WO 2004/002223 



PCT/US2003/020082 



27 

PNA (peptide nucleic acids) that are functionally equivalent to the exemplified DNA are included 
in the subject invention. 

too/52] In one embodiment of the subject invention, bacterial isolates can be cultivated under 

conditions resulting in high multiplication of the microbe. After treating the microbe to provide 
single-stranded genomic nucleic acid, the DNA can be contacted with the primers of the 
invention and subjected to PCR amplification. Characteristic fragments of toxin-encoding genes 
will be amplified by the procedure, thus identifying the presence of the toxin-encoding gene(s). 

[ooi53] Further aspects of the subject invention include genes and isolates identified using the 

methods and nucleotide sequences disclosed herein. The genes thus identified encode toxins 
active against pests. 

[00154] Toxins and genes of the subject invention can be identified and obtained by using 

oligonucleotide probes, for example. These probes are detectable nucleotide sequences which 
may be detectable by virtue of an appropriate label or may be made inherently fluorescent as 
described in International Application No. WO/93/1 6094. The probes (and the polynucleotides 
of the subject invention) may be DNA, RNA, or PNA. In addition to adenine (A), cytosine (C), 
guanine (G), thymine (T), and uracil (U; for RNA molecules), synthetic probes (and 
polynucleotides) of the subject invention can also have inosine (a neutral base capable of pairing 
with all four bases; sometimes used in place of a mixture of all four bases in synthetic probes). 
Thus, where a synthetic, degenerate oligonucleotide is referred to herein, and "n" is used 
generically, "n" can be G, A, T, C, or inosine. Ambiguity codes as used herein are in accordance 
with standard IUPAC naming conventions as of the filing of the subject application (for example, 
R means A or G, Y means C or T, etc.). 

[00155] As is well known in the art, if a probe molecule hybridizes with a nucleic acid sample , it 

can be reasonably assumed that the probe and sample have substantial 
homology/similarity/identity. Preferably, hybridization of the polynucleotide is first conducted 
followed by washes under conditions of low, moderate, or high stringency by techniques well- 
known in the art, as described in, for example, Keller, G.H., M.M. Manak (1987) DNA Probes, 
Stockton Press, New York, NY, pp. 169-170. For example, as stated therein, low stringency 
conditions can be achieved by first washing with 2x SSC (Standard Saline Citrate)/0.1% SDS 
(Sodium Dodecyl Sulfete) for 15 minutes at room temperature. Two washes are typically 
performed. Higher stringency can then be achieved by lowering the salt concentration and/or by 
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raising the temperature. For example, the wash described above can be followed by two 
washings with O.lx SSC/0.1% SDS for 15 minutes each at room temperature followed by 
subsequentwasheswith0.1xSSC/0.1%SDSfor30minuteseachat55° C. These temperatures 
can be used with other hybridization and wash protocols set forth herein and as would be known 
to one skilled in the art (SSPE can be used as the salt instead of SSC, for example). The 2x 
SSC/0.1% SDS can be prepared by adding 50 ml of 20x SSC and 5 ml of 10% SDS to 445 ml of 
water. 20x SSC can be prepared by combining NaCl (175.3 g/0.150 M), sodium citrate (88.2 
g/0.015 M), and water to 1 liter, followed by adjusting pH to 7.0 with 10 N NaOH. 10% SDS 
can be prepared by dissolving 1 0 g of SDS in 50 ml of autoclaved water, diluting to 100 ml, and 
aliquotting. 

[00156] Detection of the probe provides a means for determining in a known manner whether 

hybridization has been maintained. Such aprobe analysis provides a rapid method for identifying 
toxin-encoding genes of the subject invention. The nucleotide segments which are used as 
probes according to the invention can be synthesized using a DNA synthesizer and standard 
procedures . These nucleotide sequences can also be used as PCR primers to amplify genes of the 
subject invention. 

loom] Probes for use according to the subject invention can be derived fiom a variety of sources, 

such as any gene mentioned or suggested herein. For example, all or part of any of the following 
types of genes (coding and/or noncoding or complementary strands thereof) can be used 
according to fee subject invention: tcaA, tcaB, tcaC, tcbA, tccA, tccB, tccC, tcdA, tcdB, xptAl, 
xptDl, xptBl, xptCl, xptA2, sepA, sepB, and sepC. Unless specifically indicated otherwise, 
reference to a "tccC" gene, for example, includes all specific alleles (such as tccCl and tccC2) of 
this type of gene. The same is true for all the other genes (eg., tcdB2, tccC3, and the alleles 
mentioned in Table 17). 

[00158] Hybridization characteristics of a molecule can be used to define polynucleotides of the 

subject invention. Thus the subject invention includes polynucleotides (and/or their 
complements, preferably their foil complements) that hybridize with a polynucleotide (or an 
oligonucleotide or primer) exemplified or suggested herein. 

[ooi59] As used herein "stringent" conditions for hybridization refers to conditions which achieve 

the same, or about the same, degree of specificity of hybridization as fee conditions employed by 
the current applicants. Specifically, hybridization of immobilized DNA on Southern blots with 
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32 P-labeled gene-specific probes was performed by standard methods (see, e.g. , Maniatis, T., E.F. 
Fritsch, J. Sambrook [1982] Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY). In general, hybridization and subsequent washes were 
carried out under conditions that allowed for detection of target sequences. For double-stranded 
DNA gene probes, hybridization was carried out overnight at 20-25° C below the melting 
" temperature (Tm) of the DNA hybrid in 6x SSPE, SxDenhardfs solution, 0.1% SDS, 0.1 mg/ml 
denatured DNA. The melting temperature is described by the following formula (Beltz, G.A., 
K A. Jacobs, T.H. Eickbush, P.T. Cherbas, and F.C. Kafatos [1 983] Methods ofEnzymology, R. 
Wu, L. Grossman and K. Moldave [eds.] Academic Press, New York 100:266-285): 

Tm = 81.5° C + 16.6 Log[Na+] + 0.41(%G-fC) - 0.61(%formamide) - 600/length of 
duplex in base pairs. 
[00J6OJ Washes are typically carried out as follows: 

(1) Twice at room temperature for 1 5 minutes in lx SSPE, 0.1% SDS (low stringency 

wash). 

(2) Once at Tm-20 °C for 1 5 minutes in 0.2x SSPE, 0.1% SDS (moderate stringency 

wash). 

[oo j ei] For oligonucleotide probes, hybridization was carried out overnight at 1 0-20 ° C below the 

melting temperature (Tm) of the hybrid in 6x SSPE, 5x Denhardt's solution, 0.1% SDS, 0.1 
mg/ml denatured DNA. Tm for oligonucleotide probes was determined by the following 
formula: 

Tm (° C) = 2(number T/A base pairs) + 4(number G/C base pairs) 
(Suggs, S.V., T. Miyake, E.H. Kawashime, MJ. Johnson, K. Itakura, and R.B. Wallace [1981] 
ICN-UCLA Symp. Dev. Biol Using Purified Genes, D.D. Brown [ed.], Academic Press, New 
York, 23:683-693). 
[ooi62] Washes were typically carried out as follows: 

(1) Twice at room temperature for 15 minutes lx SSPE, 0.1% SDS (low stringency 

wash). 

(2) Once at the hybridization temperature for 15 minutes in lx SSPE, 0.1% SDS 
(moderate stringency wash). 

[ooi63] In general, salt and/or temperature can be altered to change stringency. With a labeled 

DNA fragment >70 or so bases in length, the following conditions can be used: 
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[00164] 



[00165] 



Low: 1 or 2x SSPE, room temperature 

Low. 1 or 2x SSPE, 42° C 

Moderate: 0.2x or lx SSPE, 65° C 
High: 0.1xSSPE,65° C. 

Duplex formation and stability depend on substantial complementarity between the two 
strands of a hybrid, and, as noted above, a certain degree of mismatch can be tolerated. 
Therefore, the probe sequences of the subject invention include mutations (both single and 
multiple), deletions, insertions of the described sequences, and combinations thereof, wherein 
said mutations, insertions and deletions permit formation of stable hybrids with the target 
polynucleotide of interest. Mutations, insertions, and deletions can be produced in a given 
polynucleotide sequence in many ways, and these methods are known to an ordinarily skilled 
artisan. Other methods may become known in the future. 

VCR technology. Polymerase Chain Reaction (PCR) is a repetitive, enzymatic, primed 
synthesisofanucleicacidsequence. This procedure is well-known and commonly usedby those 
skilled in this art (see Mollis, U.S. Patent Nos. 4,683,195, 4,683,202, and 4,800,159; Saiki, 
Randall K., Stephen Scharf, Fred Faloona, Kary B. Mollis, Glenn T. Horn, Henry A. Erlich, 
Norman Arnheim [1985] "Enzymatic Amplification of 0-Globin Genomic Sequences and 
Restriction Site Analysis for Diagnosisof SickleCeUAnemk,"S C «e WC e230:1350-1354). PCRis 
based on the enzymatic amplification of a DNA fragment of interest that is flanked by two 
oUgonucleotideprimers thathybridize to opposite strands ofme target sequence. The primers are 
oriented with the 3 ' ends pointing towards each other. Repeated cycles of heat denaturation of 
' the template, annealing of the primers to their complementary sequences, and extension of the 
annealedprimers wim a DNApoly^^ 

the 5'endsofthePCRprimers. The extension product of each primer can serve as a template for 
the other primer, so each cycle essentially doubles the amount of DNA fragment produced in the 
previous cycle. This results in the exponential accumulation of the specific target fragment, up to 
several milUon-fold in a few hours. By using a thermostable DNA polymerase such as Tag 
polymerase, isolated from the thennophilic bacterium 77,™ aquaticus, the amplification 
process can be completely automated Other enzymes which can be used are known to those 
skilled in the art. 

[oo,66j The DNA sequences of the subject invention can be used as primers for PCR 
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amplification. In performing PCR amplification, a certain degree of mismatch can be tolerated 
between primer and template. Therefore, mutations, deletions, and insertions (especially 
additions of nucleotides to the 5' end) of the exemplified primers fall within the scope of the 
subject invention. Mutations, insertions, and deletions can be produced in a given primer by 
methods known to an ordinarily skilled artisan. 

(00167] Modification of genes and toxins . The genes and toxins useful according to the subject 

invention include not only the specifically exemplified full-length sequences, but also portions, 
segments and/or fragments (including internal and/or terminal deletions compared to the full- 
length molecules) of these sequences, variants, mutants, chimerics, and fusions thereof. For 
example, toxins of the subject invention may be used in the form of chimeric toxins produced by 
combining portions of two or more toxins/proteins. 

[00168] Proteins of the subject invention can have substituted amino acids so long as they retain 

the characteristic pesticidal/ functional activity of the proteins specifically exemplified herein. 
"Variant" genes have nucleotide sequences that encode the same toxins or equivalent toxins 
having pesticidal activity equivalent to an exemplified protein. The terms "variant proteins" and 
"equivalent toxins" refer to toxins having the same or essentially the same biological/functional 
activity against the target pests and equivalent sequences as the exemplified toxins. As used 
herein, reference to an "equivalent" sequence refers to sequences having amino acid 
substitutions, deletions, additions, or insertions which improve or do not adversely affect 
pesticidal activity. Fragments retaining pesticidal activity are also included in this definition. 
Fragments and other equivalents that retain the same or similar function, or "toxin activity," of a 
corresponding fragment of an exemplified toxin are within the scope of the subject invention. 
Changes, such as amino acid substitutions or additions, can be made for a variety of purposes, 
such as increasing (or decreasing) protease stability of the protein (without 
materially/substantially decreasing the functional activity of the toxin). 

[00169] Equivalent toxins and/or genes encoding these equivalent toxins can be obtained/derived 

from wild-type or recombinant bacteria and/or from other wild-type or recombinant organisms 
using the teachings provided herein. Other Bacillus, Paenibacillus, Pliotorhabdus, and 
Xenorhabdus species, for example, can be used as source isolates. 

[oono] Variations of genes may be readily constructed using standard techniques for making 

point mutations, for example. In addition, U.S. Patent No. 5,605,793, for example, describes 



WO 2004/002223 PCT/US2003/020082 



32 



methods for generating additional molecular diversity by using DNA reassembly after random 
fragmentation. Variant genes canbe used to produce variantproteins; recombinant hosts can be 
used to produce the variant proteins. Using these "gene shuffling" techniques, equivalent genes 
and proteins can be constructed that comprise any 5 . 1 0, or 20 contiguous residues (amino acid or 
nucleotide) of any sequence exemplified herein. As one skilled in the art knows, the gene 
shuffling techniques canbe adjusted to obtain equivalents having, for example, 1, 2, 3,4, 5, 6, 7, 
8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 
34 35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59, 

6ol 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 
86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 
109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 
12s', 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142,143, 144, 145, 146, 
147 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 
166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 
185 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199,200,201,202,203, 
204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 
223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 
242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 
261, 262, 263, 264, 265, 266; 267, 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 
280, 281, 282, 283, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 
299, 300, 301, 302, 303, 304, 305, 306, 307, 308, 309, 310, 311, 312, 313, 314, 315, 316, 317, 
3is', 319, 320, 321, 322, 323, 324, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 335, 336, 
337, 338, 339, 340, 341, 342, 343, 344, 345, 346, 347, 348, 349, 350, 351, 352, 353, 354, 355, 
356, 357, 358, 359, 360, 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, 371, 372, 373, 374, 
375', 376, 377, 378, 379, 380, 381, 382, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 393, 
394, 395, 396, 397, 398, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 410, 411, 412, 
413', 414, 415, 416, 417, 418, 419, 420, 421, 422, 423, 424, 425, 426, 427, 428, 429, 430, 431, 
432! 433, 434, 435, 436, 437, 438, 439, 440, 441 , 442, 443, 444, 445, 446, 447, 448, 449, 450, 
451,452, 453, 454, 455, 456, 457, 458, 459, 460, 461, 462, 463, 464, 465, 466, 467, 468, 469, 
470^471, 472, 473, 474, 475, 476, 477, 478, 479, 480, 481, 482, 483, 484, 485, 486, 487, 488, 
489, 490, 491, 492, 493, 494, 495, 496, 497, 498, 499, or 500 or more contiguous residues 
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(amino acid or nucleotide), corresponding to a segment (of the same size) in any of the 
exemplified sequences (or the complements (full complements) thereof). Similarly sized 
segments, especially those for conserved regions, can also be used as probes and/or primers. 

[ooi7i] Fragments of full-length genes can be made using commercially available exonucleases or 

endonucleases according to standard procedures. For example, enzymes such as BaB\ or site- 
directed mutagenesis can be used to systematically cut off nucleotides from the ends of these 
genes. Also, genes which encode active fragments maybe obtained using a variety of restriction 
enzymes. Proteases may be used to directly obtain active fragments of these toxins. 

[00172] It is within the scope of the invention as disclosed herein that toxins maybe truncated and 

still retain functional activity. By "truncated toxin" is meant that a portion of a toxin protein may 
be cleaved and yet still exhibit activity after cleavage. Cleavage can be achieved by proteases 
inside or outside of the insect gut. Furthermore, effectively cleaved proteins can be produced 
using molecular biology techniques wherein the DNA bases encoding said toxin are removed 
either through digestion with restriction endonucleases or other techniques available to the skilled 
artisan. After truncation, said proteins can be expressed in heterologous systems such as E. coli, 
baculoviruses, plant-based viral systems, yeast and the like and then placed in insect assays as 
disclosed herein to determine activity. It is well-known in the art that truncated toxins can be 
successfully produced so that they retain functional activity while having less than the entire, full- 
length sequence. It is well known in the art that B.t. toxins can be used in a truncated (core toxin) 
form. See,e.g., AdmgetaL, Gene 36:289-300 (1985), "Characterized full-length and truncated 
plasmid clones of the crystal protein of Bacillus thuringiensis subsp kurstaM HD-73 and their 
toxicity to Manduca sexto." There are other examples of truncated proteins that retain 
insecticidal activity, including the insect juvenile hormone esterase (U.S. Pat No. 5,674,485 to 
the Regents of the University of California). As used herein, the term "toxin" is also meant to 
include functionally active truncations. , On the other hand, a protoxin portion (typically the C- 
tenninal half of a typical B.t. Cry toxin) can be added to form an active, full-length protein. See, 
e.g., U.S. Patent No. 6,218,188. 

[oom] Certain toxins of the subject invention have been specifically exemplified herein. As 

these toxins are merely exemplary of the toxins of the subject invention, it should be readily 
apparent that the subject invention comprises variant or equivalent toxins (and nucleotide 
sequences coding for equivalent toxins) having the same or similar pesticidal activity of the 
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exemplified toxin. Equivalent toxins will have amino acid similarity (and/or homology) with an 
exemplified toxin. The amino acid identity will typically be greaterthan 60%, preferably greater 
than 75%, more preferably greater than 80%, even more preferably greater than 90%, and can be 
greater than 95%. Preferred polynucleotides and proteins of the subject invention can also be 
defined in terms of more particular identity and/or similarity ranges. For example, the identity 
and/or similarity can be 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 
68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 
94, 95, 96, 97, 98, or 99% as compared to a sequence exemplified herein. Unless otherwise 
specified, as used herein percent sequence identity and/or similarity of two nucleic acids is 
determined using the algorilhm of Karlin and Altschul (1990), Proc. Natl. Acad. Sci. USA 
87:2264-2268,modmedasmKarlmandAltschul(1993),iVoc.iVflrf./lcaJ. Sci. KM90:5873- 
5877. Such an algorithm is incorporated into the NBLAST and XBLAST programs of Altschul 
et al. (1990), J. Mol. Biol. 215:402-410. BLAST nucleotide searches are performed with the 
NBLAST program, score = 100, wordlength = 12. To obtain gapped alignments for comparison 
purposes, Gapped BLAST is used as described in Altschul et al. (1997), Nucl. Acids Res. 
25:3389-3402. When utilizing BLAST and Gapped BLAST programs, the default parameters of 
the respective programs (NBLAST and XBLAST) are used. See NCBI/NIH website. The scores 
can also be calculated using me methods and algorithms of Crickmore et al. as described in the 
Background section, above. 

[00174] The amino acid homology/similarity/identity will be highest in critical regions of the 

toxin which account for biological activity or are involved in the determination of three- 
dimensional configuration which is ultimately responsible for the biological activity. In this 
regard, certain amino acid substitutions are acceptable and can be expected to be tolerated. For 
example, these substitutions can be in regions of Ihe protein that are not critical to activity. 
Analyzing the crystal structure of a protein, and software-based protein structure modeling, can 
be used to identify regions of a protein that can be modified (using site-directed mutagenesis, 
shuffling, etc.) to actually change the properties and/or increase the functionality of the protem. 

loons] Various properties and targeted 3D features of the protein can also be changed without 

adversely affecting the toxin activity/functionality of the protein. Conservative amino acid 
substitutions can be expected to be tolerated/to not adversely affect the three-dimensional 
configuration of the molecule. Amino acids can be placed in the following classes: non-polar, 
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uncharged polar, basic, and acidic. Conservative substitutions whereby an amino acid of one 
class is replaced with another amino acid of the same type fall within the scope of the subject 
invention so long as the substitution is not adverse to the biological activity of the compound. 
Table 1 provides a listing of examples of amino acids belonging to each class. 



Table 1. 


Class of Amino Acid 


Examples of Amino Acids 


Nonpolar 


Ala, Val, Leu, lie, Pro, Met, Phe, Tip 


Uncharged Polar 


Gly, Ser, Thr, Cys, Tyr, Asn, Gin 


Acidic 


Asp, Glu 


Basic 


Lys, Arg, His 



loom] In some instances, non-conservative substitutions can also be made. The critical factor is 

that these substitutions must not significantly detract from the functional/biological activity of 
the toxin. 

[00177] As used herein, reference to "isolated" polynucleotides and/or purified" toxins refers to 

these molecules when they are not associated with the other molecules with which they would 
be found in nature. Thus, reference to "isolated" and/or "purified" signifies the involvement of 
the "hand of man" as described herein. For example, a bacterial toxin "gene" of the subject 
invention put into a plant for expression is an "isolated polynucleotide." Likewise, a 
Paenibacillus protein, exemplified herein, produced by a plant is an "isolated protein." 

[00178] Because of the degeneracy/redundancy of the genetic code, a variety of different DNA 

sequences can encode the amino acid sequences disclosed herein. It is well within the skill of a 
person trained in the art to create alternative DNA sequences that encode the same, or essentially 
the same, toxins. These variant DNA sequences are within the scope of the subject invention. 

f ooi 79] Optimization of sequence for expression in plants . To obtain high expression of 

heterologous genes in plants it may be preferred to reengineer said genes so that they are more 
efficiently expressed in (the cytoplasm of) plant cells. Maize is one such plant where it may be 
preferred to re-design the heterologous gene(s) prior to transformation to increase the expression 
level thereof in said plant. Therefore, an additional step in the design of genes encoding a 
bacterial toxin is reengineering of a heterologous gene for optimal expression. 
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One reason for the reengineering of a bacterial toxin for expression in maize is due to the 
non-optimal G+C content of the native gene. For example, the very low G+C content of many 
native bacterial gene(s) (and consequent skewing towards high A+T content) results in the 
generation of sequences mimicking or duplicating plant gene control sequences that are known 
to be highly A+T rich. The presence of some A+T-rich sequences within the DNA of gene(s) 
introduced into plants (e.g., TATA box regions normally found in gene promoters) may result in 
aberrant transcription of the gene(s). On the other hand, the presence of other regulatoiy 
sequences residing in the transcribed mRNA (eg., polyadenylation signal sequences 
(AAUAAA), or sequences complementary to small nuclear RNAs involved in pre-mRNA 
splicing) may lead to RNA instability. Therefore, one goal in the design of genes encoding a 
bacterial toxin for maize expression, more preferably referred to as plant optimized genets), is to 
generate a DNA sequence having a higher G+C content, and preferably one close to that of 
maize genes coding for metabolic enzymes. Another goal in the design of the plant optimized 
gene(s) encoding a bacterial toxin is to generate a DNA sequence in which the sequence 
modifications do not hinder translation. 

The table below (Table 2) illustrates how high the G+C content is in maize. For the data 
in Table 2, coding regions of the genes were extracted from GenBank (Release 71) entries, and 
base compositions were calculated using the MacVector™ program (Accelerys, Burlington, 
MA). Intron sequences were ignored in the calculations. 
[ooi82] Due to the plasticity afforded by the redundancy/degeneracy of the genetic code (i.e., 

some amino acids are specified by more than one codon), evolution of the genomes in different 
organisms or classes of organisms has resulted in differential usage of redundant codons. This 
"codon bias" is reflected in the mean base composition of protein coding regions. For example, 
organisms with relatively low G+C contents utilize codons having A or T in the third position of 
redundant codons, whereas those having higher G+C contents utilize codons having G or C in 
the third position. It is thought that the presence of "minor" codons within a mRNA may reduce 
the absolute translation rate of that mRNA, especially when the relative abundance of the 
charged tRNA corresponding to the minor codon is low. An extension of this is that the 
diminution of translation rate by individual minor codons would be at least additive for multiple 
minor codons. Therefore, mRNAs having high relative contents of minor codons would have 



[00181] 



WO 2004/002223 



PCT/US2003/020082 



37 

correspondingly low translation rates. This rate would be reflected by subsequent low levels of 
the encoded protein. 

[com] Inreengineering genes encoding a bacterial toxin formaize (or other plant, such as cotton 

or soybean) expression, the codon bias of the plant has been determined. The codon bias for 
maize is the statistical codon distribution that the plant uses for coding its proteins and the 
preferred codon usage is shown in Table 3. After determining the bias, the percent frequency of 
the codons in the gene(s) of interest is determined. The primary codons preferred by the plant 
should be determined as well as the second and third choice of preferred codons. Afterwards, the 
amino acid sequence of the bacterial toxin of interest is reverse translated so that the resulting 
nucleic acid sequence codes for exactly the same protein as the native gene wanting to be 
heterologously expressed. The new DNA sequence is designed using codon bias information so 
that it corresponds to the most preferred codons of the desired plant. The new sequence is then 
analyzed for restriction enzyme sites that might have been created by the modification. The 
identified sites are further modified by replacing the codons with second or third choice 
preferred codons. Other sites in the sequence which could affect transcription or translation of 
the gene of interest are the exonrintron junctions (5' or 3 T ), poly A addition signals, or RNA 
polymerase termination signals. The sequence is further analyzed and modified to reduce the 
frequency of TA or GC doublets. In addition to the doublets, G or C sequence blocks that have 
more than about four residues that are the same can affect transcription of the sequence. 
Therefore, these blocks are also modified by replacing the codons of first or second choice, etc. 
with the next preferred codon of choice. 



Table 2 


Compilation of G + C 


contents of protein coding 


regions of maize genes 


Protein Class.sup.a 


Range % G + C 


Mean % G + Csup.b 


Metabolic Enzymes (76) 


44.4-75.3 


59.0 (.+-.8.0) 


Structural Proteins (18) 


48.6-70.5 


63.6 (.+-.6.7) 


Regulatory Proteins (5) 


57.2-68.8 


62.0 (.+-.4.9) 


Uncharacterized Proteins (9) 


. 41.5-70.3 


64.3 (.+-.7.2) 


All Proteins (108) 


44.4-75.3 


60.8 (.+-.5.2) 



.sup.a Number of genes in class given in parentheses. 

.sup.b Standard deviations given in parentheses. 

.sup.c Combined groups mean ignored in mean calculation 



[00184} It is preferred that the plant optimized gene(s) encoding a bacterial toxin contain about 
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63% of first choice codons, between about 22% to about 37% second choice codons, and 
between about 1 5% to about 0% thiid choice codons, wherein the total percentage is 100%. Most 
preferred the plant optimized gene(s) contains about 63% of first choice codons, at least about 
22% second choice codons, about 7.5% third choice codons, and about 7.5% fourth choice 
codons, wherein the total percentage is 100%. The preferred codon usage for engineering genes 
for maize expression are shown in Table 3. The method described above enables one skilled in 
the art to modify gene(s) that are foreign to a particular plant so that the genes are optimally 
expressed in plants. The method is further illustrated in PCT application WO 97/13402. 

La order to design plant optimized genes encoding a bacterial toxin, the amino acid 
sequence of said protein is reverse translated into a DNA sequence utilizing a non-redundant 
genetic code established from a codon bias table compiled for the gene sequences for the 
particular plant, as shown in Table 2. The resulting DNA sequence, which is completely 
homogeneous in codon usage, is further modified to establish a DNA sequence that, besides 
having a higher degree of codon diversity, also contains strategically placed restriction enzyme 
recognition sites, desirable base composition, and a lack of sequences that might interfere with 
transcription of the gene, or translation of the product mRNA. 
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Table 3. 

Preferred amino acid codons for proteins 
expressed in maize 



Amino Acid 


Codon* 


Alanine 


GCC/GCG 


Cysteine 


TGC/TGT 


Aspartic Acid 


GAC/GAT 


Glutamic Acid 


GAG/GAA 


Phenylalanine 


TTC/TTT 


Glycine 


GGC/GGG 


Histidine 


CAC/CAT 


Isoleucine 


ATC/ATT 


Lysine 


AAG/AAA 


Leucine 


CTG/CTC 


Methionine 


ATG 


Asparagine 


AAC/AAT 


Proline 


CCG/CCA 


Glutamine 


CAG/CAA 


Arginine 


AGG/CGC 


Serine 


AGC/TCC 


Threonine 


ACC/ACG 


Valine 


GTG/GTC 


Tryptophan 


TGG 


Tiyrosine 


TAC/TAT 


Stop 


TGA/TAG 



*The first and second preferred codons for maize. 



[Mm] Thus, synthetic genes that are functionally equivalent to the toxins/genes of the subject 

invention can be used to transform hosts, including plants. Additional guidance regarding the 
production of synthetic genes can be found in, for example, U.S. Patent No. 5,380,831 . 

[ooi87] In some cases, especially for expression in plants, it can be advantageous to use truncated 

genes that express truncated proteins. Hofte et al 1989, for example, discussed in the 
Background Section above, discussed protoxin and core toxin segments of B.t. toxins. Preferred 
truncated genes will typically encode 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 
56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 
82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99% of the full-length toxin. 

I°° I88 J Transgenic hosts. The toxin-encoding genes of the subject invention can be introduced 

into a wide variety of microbial or plant hosts. In preferred embodiments, transgenic plant cells 
and plants are used. Preferred plants (and plant cells) are corn, maize, and cotton. * 

[00/89] In preferred embodiments, expression of the toxin gene results, directly or indirectly, in 
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the intracellular production (and maintenance) of the pesticide proteins. Plants can be rendered 
insect-resistant in this manner. When tramgemc/recrnnbrnant/transformed/transfected host cells 
(or contents thereof) are ingested by the pests, the pests will ingest the toxin. This is the 
preferred manner in which to cause'contact of the pest with the toxin. The result is control 
(killing or making sick) of the pest. Sucking pests can also be controlled in a similar manner. 
Alternatively, suitable microbial hosts, e.g., Pseudomonas such as P. fluorescein, canbe applied 
where target pests are present; the microbes can proliferate there, and are ingested by the target 
pests. The microbe hosting the toxin gene can be treated under conditions that prolong the 
activity of the toxin and stabilize the cell. The treated cell, which retains the toxic activity, can 
then be applied to the environment of the target pest. 
loom] Where the toxin gene is introduced via a suitable vector into a microbial host, and said 

host is applied to the environment in a living state, certain host microbes should be used. 
Microorganism hosts are selected which are known to occupy the "phytosphere" (phylloplane, 
phyllosphere, rbizosphere, and/or rhizoplane) of one or more crops of interest. These 
microorganisms are selected so as to be capable of successfully competing in the particular 
environment (crop and other insect habitats) with the wild-type microorganisms, provide for 
stable maintenance and expression of the gene expressing the polypeptide pesticide, and, 
desirably, provide for improved protection of the pesticide from environmental degradation and 
inactivation. 

loom] A large number of microorganisms are known to inhabit the phylloplane (the surface of 

the plant leaves) and/or the rhizosphere (the soil surrounding plant roots) of a wide variety of 
important crops. These microorganisms include bacteria, algae, and fungi. Of particular interest 
are microorganisms, such as bacteria, e.g., genera Pseudomonas, Erwinia, Serratia, Klebsiella, 
Xanthomonas,Streptomyces,Rlnzobium,Rhodopseufomon^ 

Acetobacter, Lactobacillus, Arthrobacter, Azotobacter, Leuconostoc, and Alcaligeiies; fungi, 
particularly yeast, eg., genera Saccharomyces, Cryptococcus,Kluyveromyces,Sporobolomyces, 
RJwdotorula, and Aureobasidium. Of particular interest are such phytosphere bacterial species as 
Pseudomonas syringae, Pseudomonas fluoresces, Serratia marcescens, Acetobacter xylinum, 
Agrobacterium tumefaciens, RJiodopseudomonas spheroides, Xanthomonas campestris, 
Rhizobium melioti, Alcaligenes entrophus, and Azotobacter vinlandii; and phytosphere yeast 
species such as Rhodotorula rubra, R. glutinis.R. marina, R aurantiaca, Cryptococcus albidus, 
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C. diffluens, C. laurentii, Saccharomyces rosei, S. pretoriensis, S. cerevisiae, Sporobolomyces 
roseus, 5. odorids, Kluyveromyces veronae, and Aureobasidium pollulans. Also of interest are 
pigmented microorganisms. 

[com] Insertion of genes to form transgenic hosts . One aspect of the subject invention is the 

transformation/transfection of plants, plant cells, and other host cells with polynucleotides of the 
subject invention that express proteins of the subject invention. Plants transformed in this 
manner can be rendered resistant to attack by the target pest(s). 

loom} A wide variety of methods are available for introducing a gene encoding a pesticidal 

protein into the target host under conditions that allow for stable maintenance and expression of 
the gene. These methods are well known to those skilled in the art and are described, for 
example, in United States Patent No. 5,135,867. 

[ooi94] For example, a large number of cloning vectors comprising a replication system in E. coli 

and a marker that permits selection of the transformed cells are available for preparation for the 
insertion of foreign genes into higher plants. The vectors comprise, for example, pBR322, pUC 
series, M13mp series, pACYC184, etc. Accordingly, the sequence encoding the toxin can be 
inserted into the vector at a suitable restriction site. The resulting plasmid is used for 
transformation into E. colt The E. coli cells are cultivated in a suitable nutrient medium, then 
harvested and lysed. The plasmid is recovered. Sequence analysis, restriction analysis, 
electrophoresis, and other biochemical-molecular biological methods are generally carried out as 
methods of analysis. After each manipulation, the DNA sequence used can becleavedand joined 
to the next DNA sequence. Each plasmid sequence can be cloned in the same or other plasmids. 
Depending on the method of inserting desired genes into the plant, other DNA sequences maybe 
necessary. It for example, the Ti or Ri plasmid is used for the transformation of the plant cell, 
then at least the right border, but often the right and the left border of the Ti or Ri plasmid T- 
DNA, has to be joined as the flanking region of the genes to be inserted. The use of T-DNA for 
the transformation of plant cells has been intensively researched and described in EP 120 516; 
Hoekema (1985) In: The Binary Plant Vector System, Ofifeet-durkkerij Kanters B.V., 
Alblasserdam, Chapter 5; Fraley et al , Crit. Rev. Plant Sci. 4: 1 -46; and An et al. (1 985) EMBO 
J. 4:277-287. 

[ooi95] A large number of techniques are available for inserting DNA into a plant host cell. 

Those techniques include transformation with T-DNA using Agrobacterium tumefaciens or 
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Agrobacterium rhizogenes as transformation agent, fusion, injection, biolistics (microparticle 
bombardment), or electroporation as well as omerpossiblemethods. K Agrobacteria are used for 
the transformation, the DNA to be inserted has to be cloned into special plasmids, namely either 
into an intermediate vector or into a binary vector. The intermediate vectors can be integrated 
into the Ti or Ri plasmid by homologous recombination owing to sequences that are homologous 
to sequences in me T-DNA. The Ti or ^ 

transfer of the T-DNA Intermediate vectors cannot replicate themselves in Agrobacteria. The 
intermediate vector can be transferred into Agrobacterium tumefaciens by means of a helper 
plasmid (conjugation). Binary vectors can replicate themselves both in E. coli and in 
Agrobacteria. They comprise a selection marker gene and a linker or polylinker which are 
framed by the right and left T-DNA border regions. They can be transformed directly into 
AgrobacteriaiKolstosetal. [1978] MoL Gen. Genet. 163:181-187). TteAgrobacteriumMse&ts 
host cell is to comprise a plasmid carrying a vir region. The vir region is necessary for the 
transfer of the T-DNA into the plant cell. Additional T-DNA maybe contained. The bacterium 
sotransformedisusedformetransformationofplantcells. Plant explants can advantageously be 
cultivated with Agrobacterium tumefaciens or Agrobacterium rhizogenes for the transfer of the 
DNA into the plant cell. Whole plants can then be regenerated from tire infected plant material 
(for example, pieces of leaf, segments of stalk, roots, but also protoplasts or suspension- 
cultivated cells) in a suitable medium, which may contain antibiotics or biocides for selection. 
The plants so obtained can then be tested for the presence of the inserted DNA. No special 
demands are made of the plasmids in the case of injection and electroporation. It is possible to 
use ordinary plasmids, such as, for example, pUC derivatives. 
loomj The transformed cells grow inside the plants in the usual manner. They can form germ 

cells and transmit the transformed trait(s) to progeny plants. Such plants can be grown in the 
normal manner and crossed withplants that have the same transformed hereditary factors or other 
hereditary factors. The resulting hybrid individuals have the corresponding phenotypic 
properties. 

loomj In some preferred embodiments of the invention, genes encoding the bacterial toxin are 

expressed from transcriptional units inserted into the plant genome. Preferably, said 
transcriptional units are recombinant vectors capable ofstable integration into the plant genome 
and enable selection of transformed plant lines expressing mRNA encoding the proteins. 
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[com] Once the inserted DNA has been integrated in the genome, it is relatively stable there (and 

does not come out again). It normally contains a selection marker feat confers on the transformed 
plant cells resistance to a biocide or an antibiotic, such as kanamycin, G418, bleomycin, 
hygromycin, or chloramphenicol, inter alia. The individually employed marker should 
accordingly permit the selection of transformed cells rather than cells that do not contain die 
inserted DNA. The gene(s) of interest are preferably expressed either by constitutive or inducible 
promoters in the plant cell. Once expressed, the mRNA is translated into proteins, thereby 
incorporating amino acids of interest into protein. The genes encoding a toxin expressed in the 
plant cells can be under the control of a constitutive promoter, a tissue-specific promoter, or an 
inducible promoter. 

loom) Several techniques exist for introducing foreign recombinant vectors into plant cells, and 

for obtaining plants that stably maintain and express the introduced gene. Such techniques 
include the introduction of genetic material coated onto microparticles directly into cells (U.S. 
Pat. Nos. 4,945,050 to Cornell and 5,141 ,13 1 to DowElanco, now Dow AgroSciences, LLC). In 
addition, plants may be transformed using Agrobacteriwn technology, see U.S. Pat. No. 
5,177,010 to University of Toledo; 5,104,310 to Texas A&M; European Patent Application 
0131624B1; European Patent Applications 120516, 159418B1 and 176,112toSchilperoot;U.S. 
Pat. Nos. 5,149,645, 5,469,976, 5,464,763 and 4,940,838 and 4,693 ,976 to Schilperoot; European 
Patent Applications 1 16718, 290799, 320500 all to Max Planck; European Patent Applications 
604662 and 627752, and U.S. Pat. No. 5,591,616, to Japan Tobacco; European Patent 
Applications 0267159 and 0292435, and U.S. Pat. No. 5,231,019, all to Ciba Geigy, now 
Novartis; U.S. Pat. Nos. 5,463,174 and 4,762,785, both to Calgene; and U.S. Pat Nos. 5,004,863 
and 5,159,135, both to Agracetus. Other transformation technology includes whiskers 
technology. See U.S. Pat Nos. 5,302,523 and 5,464,765, both to Zeneca. Electroporation 
technology has also been used to transform plants. See WO 87/06614 to Boyce Thompson 
Institute; U.S. Pat. Nos. 5,472,869 and 5,384,253, both to Dekalb; and WO 92/09696 and WO 
93/2 1 335, both to Plant Genetic Systems. Furthermore, viral vectors can also be used to produce 
transgenic plants expressing the protein of interest For example, monocotyledonous plant can be 
transformed with a viral vector using the methods described in U.S. Pat. Nos. 5,569,597 to 
Mycogen Plant Science and Ciba-Giegy, now Novartis, as well as U.S. Pat Nos. 5,589,367 and 
5,3 1 6,93 1 , both to Biosource. 
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[00200] 



Asmentionedpreviously.themaimer in whichtheDNA construct is introduced into the 
plantbost is not critical to this invention. Any method which provides for efficient transformation 
may be employed. For example, various methods for plant cell transformation are described 
herein and include the use of Ti or Ri-plasmids and the like to perform Agrobacterium mediated 
transformation. In many instances, it will be desirable to have the construct used for 
transformation bordered on one or both sides by T-DNA borders, more specifically the right 
border. This is particularly useful when the construct uses Agrobacterium twnefaciens or 
Agrobacteriumrhizogenes tzimodt for transformation, although T-DNA borders may find use 
with other modes of transformation. Where Agrobacterium is used for plant cell transformation, a 
vector may be used wmchmay be introducedmto me host for homologous 
DNA or the Ti or Ri plasmid present in the host. Introduction of the vector may be performed via 
electropomuon,tri-parentalma^^ 

which are known to those skilled in the art. The manner of vector transfoimation into the 
Agrobacterium host is not critical to this invention. The Ti or Ri plasmid containing the T-DNA 
for recombination maybe capable or incapable of causing gall formation, and is not critical to 
said invention so long as the vir genes are present in said host: 

Insomecuseswhere^grotatf^ 
bemg within the T-DNAborders will be inserted into abroad spectrum vector such as P RK2 or 

derivauvesthereofasd^ 

515, whichare incorporated herein by reference, mcluded within the expression construct and the 
T-DNA will be one or more markers as described herein which allow for selection of transformed 
Agrobacterium and transformed plant cells. The particular marker employed is not essential to 
this invention, with the preferred marker depending on the host and construction used. 
[00202) For transformation of plant cells using Agrobacterium, explants may be combined and 

incubated with the transformed Agrobacterium for sufficient time to allow transformation 
thereof. After transformation, the Agrobacteria are killed by selection with the appropriate 
antibiotic and plant cells are cultured with the appropriate selective medium. Once calli are 
formed, shoot formation can be encouraged by employing the appropriate plant hormones 
according to methods well known in the art of plant tissue culturing and plant regeneration. 
However, a callus intermediate stage is not always necessary. After shoot formation, said plant 
cells can be transferred to medium which encourages root formation thereby completing plant 
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regeneration. The plants may then be grown to seed and said seed can be used to establish future 
generations. Regardless of transformation technique, the gene encoding a bacterial toxin is 
preferably incorporated into a gene transfer vector adapted to express said gene in a plant cell by 
including in the vector a plant promoter regulatory element, as well as 3 ! non-translated 
transcriptional termination regions such as Nos and the like. 

[00203] In addition to numerous technologies for transforming plants, the type of tissue which is 

contacted with the foreign genes may vary as well. Such tissue would include but would not be 
limited to embryogenic tissue, callus tissue types I, n, and HI, hypocotyl, meristem, root tissue, 
tissues for expression in phloem, and the like. Almost all plant tissues may be transformed during 
dedifferentiation using appropriate techniques described herein. 

[00204] As mentioned above, a variety of selectable markers can be used, if desired. Preference 

for a particular marker is at the discretion of the artisan, but any of the following selectable 
markers may be used along with any other gene not listed herein which could function as a 
selectable marker. Such selectable markers include but are not limited to aminoglycoside 
phosphotransferase gene of transposon Tn5 (Aph II) which encodes resistance to the antibiotics 
kanamycin, neomycin and G41 8, as well as those genes which encode for resistance or tolerance 
to glyphosate; hygromycin; methotrexate; phosphinofhricin (bialaphos); imidazolinones, 
sulfonylureas and triazolopyrimidine herbicides, such as chlorsulfuron; bromoxynil, dalapon and 
the like. 

[00205] In addition to a selectable marker, it may be desirous to use a reporter gene. In some 

instances a reporter gene may be used with or without a selectable marker. Reporter genes are 
genes that are typically not present in the recipient organism or tissue and typically encode for 
proteins resulting in some phenotypic change or enzymatic property. Examples of such genes are 
provided in K. Wising et ah Ann. Rev. Genetics, 22, 421 (1988). Preferred reporter genes 
include the beta-glucuronidase (GUS) of the uidA locus of E. coli, the chloramphenicol acetyl 
transferase gene from Tn9 of E. coli, the green fluorescent protein from the bioluminescent 
jellyfish Aequorea victoria, and the luciferase genes from firefly Photinxis pyralis. An assay for 
detecting reporter gene expression may then be performed at a suitable time after said gene has 
been introduced into recipient cells. A preferred such assay entails the use of the gene encoding 
beta-glucuronidase (GUS) of the uidA locus of E. coli as described by Jefferson et al, (1987 
Biochem. Soc. Trans. 15, 17-19) to identify transformed cells. 
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In addition to plant promoter regulatory elements, promoter regulatory elements from a 
variety of sources can be used efficiently in plant cells to express foreign genes. For example, 
promoter regulatory elements of bacterial origin, such as the octopine synthase promoter, the 
nopaline synthase promoter, the mannopine synthase promoter; promoters of viral origin, such as 
the cauliflower mosaic virus (35S and 19S), 35T (which is a Te-engineered 35S promoter, see 
U.S. Pat. No. 6,166,302, especially Example 7E) and the like may be used. Plant promoter 
regulatory elements include but are not limited to ribulose-1 ,6-bisphosphate (RUBP) carboxylase 
small subunit (ssu), beta-conglycinin promoter, beta-phaseolin promoter, ADH promoter, heat- 
shock promoters, and tissue specific promoters. Other elements such as matrix attachment 
regions, scaffold attachment regions, introns, enhancers, polyadenylation sequences and the like 
may be present and thus may improve the transcription efficiency or DNA integration. Such 
elements may or may not be necessary for DNA function, although they can provide better 
expression or functioning of the DNA by affecting transcription, mRNA stability, and the like. 
Such elements may be included in the DNA as desired to obtain optimal performance of the 
transformed DNA in the plant. Typical elements include but are not limited to Adh-intron 1, 
Adh-intron 6, the alfalfa mosaic virus coat protein leader sequence, the maize streak virus coat 
protein leader sequence, as well as others available to a skilled artisan. Constitutive promoter 
regulatory elements may also be used thereby directing continuous gene expression in all cells 
types and at all times actin, ubiquitin, CaMV 35S, and the like). Tissue specific promoter 
regulatory elements are responsible for gene expression in specific cell or tissue types, such as the 
leaves or seeds (e.g., zein, oleosin, napin, ACP, globulin and the like) and these may also be 
used. 

Promoter regulatory elements may also be active during a certain stage of the plant's 
development as well as active in plant tissues and organs. Examples of such include but are not 
limited to pollen-specific, embryo-specific, com-silk-specific, cotton-fiber-specific, root-specific, 
seed-endospenn-specific promoter regulatory elements and the like. Under certain circumstances 
it may be desirable to use an inducible promoter regulatory element, which is responsible for 
expression of genes in response to a specific signal, such as: physical stimulus (heat shock 
genes), light (RUBP carboxylase), hormone (Em), metabolites, chemical, and stress. Other 
desirable transcription and translation elements that function in plants may be used. Numerous 
plant-specific gene transfer vectors are known in the art. 
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[Q0208] Standard molecular biology techniques may be used to clone and sequence the toxins 

described herein. Additional information may be found in Sambrook, J., Fritsch, E. F., and 
Maniatis, T. (1989), Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, which 
is incorporated herein by reference. 

rw209] Resistance Management . With increasing commercial use of insecticidal proteins in 

transgenic plants, one consideration is resistance management. That is, there are numerous 
companies using Bacillus thuringiensis toxins in their products, and there is concern about 
insects developing resistance to B. t. toxins. One strategy for insect resistance management would 
be to combine the TC toxins produced by Xenorhabdus, Photorhabdus, and the like with toxins 
such as B.L, crystal toxins, soluble insecticidal proteins from Bacillus stains (see, eg., WO 
98/1 8932 and WO 99/57282), or other insect toxins. The combinations could be formulated for a 
sprayable application or could be molecular combinations. Plants could be transformed with 
bacterial genes that produce two or more different insect toxins (see, e.g. t Gould, 38 Bioscience 
26-33 (1 988) and U.S. Patent No. 5,500,365; likewise, European Patent Application 0 400 246 
Al and U.S. Patents 5,866,784; 5,908,970; and 6,172,281 also describe transformation of a plant 
with two B. t crystal toxins). Another method of producing a transgenic plant that contains more 
than one insect resistant gene would be to first produce two plants, with each plant containing an 
insect resistance gene. These plants could then be crossed using traditional plant breeding 
techniques to produce a plant containing more than one insect resistance gene. Thus, it should be 
apparent that the phrase "comprising a polynucleotide" as used herein means at least one 
polynucleotide (and possibly more, contiguous or not) unless specifically indicated otherwise. 

[M2io] Formulations and Other Delivery Systems . Formulated bait granules containing spores 

and/or crystals of the subject Paenibacillus isolate, or recombinant microbes comprising the 
genes obtainable from the isolate disclosed herein, can be applied to the soil. Formulated product 
can also be applied as a seed-coating or root treatment or total plant treatment at later stages of 
the crop cycle. Plant and soil treatments of cells maybe employed as wettable powders, granules 
or dusts, by mixing with various inert materials, such as inorganic minerals (phyllosilicates, 
carbonates, sulfates, phosphates, and the like) or botanical materials (powdered corncobs, rice 
hulls, walnut shells, and the like). The formulations may include spreader-sticker adjuvants, 
stabilizing agents, other pesticidal additives, or surfactants. Liquid formulations maybe aqueous- 
based or non-aqueous and employed as foams, gels, suspensions, emulsifiable concentrates, or 



WO 2004/002223 PCT/US2003/020082 

48 

the like. The ingredients may include rheological agents, surfactants, emulsifiers, dispersants, or 
polymers. 

[00211] As would be appreciated by a person skilled in the art, the pesticidal concentration will 

vary widely depending upon the nature of the particular formulation, particularly whether it is a 
concentrate or to be used directly. The pesticide will be present in at least 1% by weight and may 
be 1 00% by weight. The dry formulations will have from about 1 -95% by weight of the pesticide 
while the liquid formulations will generally be from about 1-60% by weight of the solids in the 
liquid phase. The formulations will generally have from about 1 0 2 to about 1 0 4 cells/mg. These 
formulations will be administered at about 50 mg (liquid or dry) to 1 kg or more per hectare. 

loom] The formulations can be applied to the environment of the pest, e.g. , soil and foliage, by 

spraying, dusting, sprinkling, or the like. 

[00213] Another delivery scheme is the incorporation of the genetic material of toxins into a 

baculovirus vector. Baculoviruses infect particular insect hosts, including those desirably 
targeted with the toxins. Infectious baculovirus harboring an expression construct for the toxins 
could be introduced into areas of insect infestation to thereby intoxicate or poison infected 
insects. 

[00214] Insect viruses, or baculoviruses, are known to infect and adversely affect certain insects. 

The affect of the viruses on insects is slow, and viruses do not immediately stop the feeding of 
insects. Thus, viruses are not viewed as being optimal as insect pest control agents. However, 
combining the toxin genes into a baculovirus vector could provide an efficient way of 
transmitting the toxins. In addition, since different baculoviruses are specific to different insects, 
it maybe possible to use a particular toxin to selectively target particularly damaging insect pests. 
A particularly useful vector for the toxins genes is the nuclear polyhedrosis virus. Transfer 
vectors using this virus have been described and are now the vectors of choice for transferring 
foreign genes into insects. The virus-toxin gene recombinant may be constructed in an orally 
transmissible form. Baculoviruses normally infect insect victims through the mid-gut intestinal 
mucosa. The toxin gene inserted behind a strong viral coat protein promoter would be expressed 
and should rapidly kill the infected insect. 

[0021S] In addition to an insect virus or baculovirus or transgenic plant delivery system for the 

protein toxins of the present invention, the proteins may be encapsulated using Bacillus 
thuringiensis encapsulation technology such as but not limited to U.S. Pat. Nos. 4,695,455; 
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4,695,462; 4,86 1 ,595 which are all incorporated herein by reference. Another delivery system for 
the protein toxins of the present invention is formulation of the protein into a bait matrix, which 
could then be used in above and below ground insect bait stations. Examples of such technology 
include but are not limited to PCT Patent Application WO 93/23998, which is incorporated 
herein by reference. 

[002J6] Plant RNA viral based systems can also be used to express bacterial toxin. In so doing, 

the gene encoding a toxin can be inserted into the coat promoter region of a suitable plant virus 
which will infect the host plant of interest. The toxin can then be expressed thus providing 
protection of the plant from insect damage. Plant RNA viral based systems are described in U.S . 
Pat. Nos. 5,500,360 to Mycogen Plant Sciences, Inc. and U.S. Pat Nos. 5,3 1 6,93 1 and 5,589,367 
to Biosource Genetics Corp. 

[00217] In addition to producing a transformed plant, there are other delivery systems where it 

may be desirable to reengineer the bacterial gene(s). For example, a protein toxin can be 
constructed by fusing together a molecule attractive to insects as a food source with a toxin. 
After purification in the laboratory such a toxic agent with "built-in" bait could be packaged 
inside standard insect trap housings. 

[00218] Mutants . Mutants of the DAS1 529 and DB482 isolates of the invention can be made by 

procedures that are well known in the art. For example, an asporogenous mutant can be obtained 
through ethylmethane sulfonate (EMS) mutagenesis of an isolate. The mutants can be made 
using ultraviolet light and nitrosoguanidine by procedures well known in the art. 

[00219] All patents, patent applications, provisional applications, and publications referred to or 

cited herein are incorporated by reference in their entirety to the extent they are not inconsistent 
with the explicit teachings of this specification. 

[00220] Following are examples that illustrate procedures for practicing the invention. These 

examples should not be construed as limiting. All percentages are by weight and all solvent 
mixture proportions are by volume unless otherwise noted. 



Example 1 - Isolation and Discovery of Insecticidal Activity of DAS 1 529 as a Paenibacillus sp. 
[00221] A bacterial strain, designated DAS 1 529, was found to produce factors that were growth 
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(00222] 



[00223] 



[00224] 



inhibitory to neonates of lepidopteran insects, corn earworm (Heliothis tea; CEW), tobacco 
budworm (Heliothis virescens; TBW), and tobacco homworm (Maiiduca sexta; THW). 

DAS1529 was cultured in 2% Protease PeptoneNo. 3 (PP3) medium (Difco Laboratories, 
Detroit, MI) supplemented with 1 .25% NaCl or in JB medium supplemented with 0.2% glucose. 
Bacterial culture was grown at 25° C for -40 hours at 150 rpm. 

The insecticidally active factors were initially found in the fermentation broth that was 
concentrated on 5 kDa molecular weight cutofffilters. Those factors were heat labile (inactivated 
by heating at 85° C for 20 minutes). These data indicated that the factors were proteinaceous. 

See also end of Example 4. 

Toidentify active factors in cell pellets, the bacterial culture was centrifugedatoOOOrpm 
at 4° C for 1 5 minutes, washed once with sterile distilled water, and resuspended to 33X of the 
original culture volume in sterile distilled water, and subjected to insect bioassay as described 
belowinExample3. Thebioassay^taforDAS1529st ra missummarizedinTable4. Thedata 
showed lhat the culturebroth and concentrated DAS 1529 bacterial cells conferred good activity 
against CEW (30 to 50% mortality at 33X) and TBW (100% mortality at 33X). Those toxin 
factors in DAS1529 have significant relevance to the development of commercial transgenic 
products targeting lepidopteran insects (e.g. CEW and TBW) in corn and cotton. 



Table 4 
Bioassay of DAS 1529 Strain 



[00225] 



Insects TBW CEW Grubs 

Broth Activity +++ +++ n.d. 

Pellet Activity +++ I 

•n.d. - not determined; -, ++, +++, no, moderate and higji 
activity, respectively 

Exam ple 2 -^nssificatioT i of DAS 1529 

Molecular phylogeny was performed to determine the taxonomic affiliation of strain 
DAS1529.Thenucleotidesequenceofthel6SrDNAofDAS 1529 was determined and used for 
similarity and phylogenetic analyses (using the MicroSeq Kit from ABI). The sequence is 
provided as SEQ ID NO:16. BLAST search results are as follows: 

Core E 
(bits) 

gij^5395282|enib|AJ320490.l|PTH320490 Paenibacillus thiamino . . . 2906 
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gi | 3328014 |gb|AF071859.l|AF071859 
O.O 

gi| 3328015 |gb|AF071860.l|AF071860 
0.0 

gi | 2769591 1 emb | Y16129 . 1 1 PS16SC168 
0.0 

gi | 2769590 | emb | Y16128 . 1 | PS16ST168 
0.0 

gi | 2077917 |dbj | D78475 . 1 | D78475 
0.0 

gi | 3328016 | gb | AF071861 . 1 1 AF071861 
0.0 

gi I 2895560 I gb I AF03 94 08 . 1 1 
0.0 

gi I 2077936 | dbj | D88513 . 1 1 D88513 
0.0 



Paenibacillus popilliae s. . 

Paenibacillus popilliae s.., 

Paenibacillus sp. C-168 1... 

Paenibacillus sp. T-168 1... 

Paenibacillus 
t hiarainolyt i cu._ 

Paenibacillus lentimorbus. . . 

Bacillus tipchiralis 16S 
ribosoma . . . 

Paenibacillus 
thiaminolyticu. . . 



gi|l5395283|emb|AJ320491.l|PAL320491 Paenibacillus alvei pa... 
0.0 



2834 
2815 
2699 
2509 

2503 
2493 

2493 

2493 
2404 



[00226] 



These same top scoring sequences from the BLAST search were also compared using the 
Gap routine (Needleman and Wunsch, J. Mol. Biol. 48; 443-453 (1970)) from GCG version 
1 0.2, with the following results: 



%Sim 

gi| 15395282) emb |AJ320490.l|PTH320490 Paenibacillus thiamine . . 

Paenibacillus popilliae s. 



gi|3328014|gb|AF071859.l|AF071859 
99.6 

gi |3328015|gb|AF071860.l|AF071860 
99.3 

gi I 2769591 1 emb | Y16129 . 1 1 PS16SC168 
97.3 

gi I 2769590 1 emb | Y16128 . 1 1 PS16ST168 
97.4 

gi] 2077917 I dbj | D78475 . 1 |D78475 
98.1 

gi|3328016|gb|AF071861.l|AF071861 
98.9 

gi|2895560|gb|AF039408.l| 
96.9 

gi 1 2 07793 6 I dbj | D8B513 . 1 1 D88513 



Paenibacillus popilliae s. 

Paenibacillus sp. C-168 1., 

Paenibacillus sp. T-168 1. . 

Paenibacillus 
thiaminolyt i co.. 

Paenibacillus lentimorbus . . 

Bacillus tipchiralis 16S 
ribosoma . . . 

Paenibacillus 
thiaminolyticu. . . 



%Ident 

99.2 99.6 
99.2 

99 . 2 

97.1 

97.4 

96.5 
98.8 

96.0 



96.7 
95.2 



[00227] 



98.7 

gi|l5395283|emb|AJ320491.l|PAL32 0491 Paenibacillus alvei pa... 
95.3 

[%Ident, matches of unambiguous bases; %Sim, %Ident plus potentially 
matching ambiguous bases] 

A number of related sequences, including the top scoring sequences noted above, were 
also trimmed and aligned as described by Shida et al {Int. J. Syst. Bacteriol 47:289-298, 1997), 
using the sequence alignment program CLUSTAL W (Thompson, J.D., D. G. Higgins, and T J. 
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Gibson, Nucleic Acids Res. 22:4673-4680, 1994). The results clearly place DAS1529 in the 
Paenibacillus popilliae/Paenibacillus lentimorbus subcluster of the genus Paenibacillus 
identified by Pettersson era/. (Int. J. Syst. Bacterial. 49:531-540, 1999), and are consistent with 
the analyses reported above. This subcluster includes the insect-associated species P. popilliae 
and P. lentimorbus, as well as P. thiaminolyticus, Paenibacillus sp. T-168 and C-168, and 
"Bacillus tipchiralis," which are not known to have an insect association (Pettersson et al, 1999). 
As noted by Wayne et al. (Int. J. Syst. Bacterid. 37:463-464, 1987) and Vandamme et al. 
(Microbiol. Rev. 60:407-438), rDNA sequences that are greater than 97% identical cannot 
generally be used to assign a bacterial strain to a particular species in the absence of additional 
information. In the case of DAS1529, insecticidal activity on lepidoptera and evidence of a 
thiaminase are not consistent with known P. popilliae and P. lentimorbus, and the insect 
association is not consistent with known P. thiaminolyticus (as well as the other subcluster 
species). 

[00228] As other Paenibacillus strains are known causative agents of milky disease in larvae of 

Japanese beetles (Popillia jalonica; Harrison et al, 2000), the DAS1529 was tested for activity 
on June beetles, a relative of Japanese beetles. No activity was found for cultures grown in JB 
and PP3 medium. Microscopic examination of those cultures revealed even-colored rods with no 
visible sporulation or parasporal crystals present. We are able to show DAS1529 can sporulatein 
defined medium and culture conditions and within the hemolymph of Manducca sexta. It is 
known that the Japanese beetle active Paenibacillus strains are typically associated with 
paraspore and parasporal bodies (Harrison et al, 2000). 

[00229] Additional work will be needed to determine whether DAS1 529 belongs to an existing 

species or should be assigned a new species designation. 

Exam ple 3 - Insect Bioassav Methodology 
[00230] Two insect bioassay methods were used to obtain results presented below. A 96-well 

foimat and a 128-well format were used forprimary screening for activity against lepdidopteran 
insects. A 24-well diet incorporation format was used to determine specific activity (LC50s) of 
the toxin. 

[00231] For the 96-well format, artificial diet was dispensed into 96-well microtiterplates. Each 
well measured approximately 0.32 cm 2 and contained 150 ^artificial diet Samples/toxins were 
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applied at a rate of 50 ^1 /well for fermentation broth, cell pellets, and purified toxins. Positive 
control (Cryl Ac) at appropriate doses and negative controls (water, medium blank, bacterial 
host strains not expressing target toxin) at top dose were included. Samples were allowed to dry 
for approximately 1-3 hours so that the samples lost their moisture but the diet retained its 
moisture. Either insect eggs were dispersed onto the surface of the sample treated diet, or a 
single insect larva was seeded per well. The infested plate was sealed either with iron-on mylar 
covering or covered with sticky lidding with perforations. Tiny air holes were made in the mylar 
covering to ensure air supply to the insects. The plates were incubated at 28° C for 5 days and 
scored for mortality and stunting. This was done on a per-well basis, ignoring the number of 
larvae per well, as multiple eggs are often deposited per well. Activity scores were then 
assigned to each treatment: 0 = no activity, larvae healthy like water control wells, 1= larvae 
were stunted, or stunted with some mortality, 2 = larvae were all dead. 
[00232] The specific activities (LC50s) of samples/toxins were determined by diet incorporation 

bioassay in 24-well Nutrend trays (Nu-Trend™ Container Corp., Jacksonville, FL). Insect 
artificial diet was made just prior to use and held in liquid state at 55° C in a water bath. Serial 
dilutions (> 5) were made by mixing 27 ml of artificial diet with no more than 3 ml of 
samples/toxins. A total of 30 ml sample and diet mixture was vortexed for 30 seconds and then 
evenly distributed into each tray, filling ~ 50% of the well volume. Trays were allowed to cool 
for at least 30 minutes prior to infesting. One test insect was infested into each well, and clear 
mylar was sealed over the top of each tray to contain the insects. Small holes were punched 
with an insect pin in the mylar over each well for air circulation. Assays were generally held at 
25° C for 6 days but some may have been held at 30° C for 4 days if quicker results were 
needed. A set of positive and negative controls was run for each assay. Assays were graded on 
the basis of mortality but data on stunting was also recorded. Statistical methods were used to 
estimate LC50s for assayed samples and was expressed as ng or /xg/ml diet 

Example 4 - Biochemical Purification and Characterization of Insecticidal Toxins from 
DAS 1529 Fermentation Broth - Thiaminase 
(00233] The fermentation broths of DAS 1 529 contained insecticidal activity against lepidopteran 

species, such as tobacco budwonn, corn earworm, and tobacco hornworm. The nature of the 
insecticidal activity was investigated by biochemical purification and characterization. Com 
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earworm bioassay, as described in Example 3, was used during the purification process to Mow 
insecticidal activities. 

FennentationbrothsofDAS1529 were produced using2%PP3 supplemented with 1.25% 
NaCl and processed as described in Example 1 . Four liters of broth was concentrated using an 
Amicon (Beverly, MA) spiral ultrafiltration cartridge Type S1Y10 (molecular weight cut off 10 
kDa) attached to an Amicon M-12 filtration device according to the manufacturer's 
recommendations. The retentate was diafiltered with 20 mM sodium phosphate, pH 7.0 (Buffer 
A) and applied at 5 ml/min to a Q cepharose XL anion exchange column (1.6 x 10 cm). The 
column was washed with 5 bed volumes of Buffer A to remove unboundproteins. Toxin activity 
was eluted by 1 .0 M NaCl in Buffer A. 

The fraction containing the insecticidal activity was loaded in 20 ml aliquots onto a gel 
filtration column Macro-Prep SE1000/40 (2.6 x 100 cm) which was equilibrated with Buffer A. 
The protein was eluted in Buffer A at a flow rate of 3 ml/min. Fractions with activity against 
com earworm were pooled and were applied to a MonoQ (1 .0 x 10 cm) column equilibrated with 
20 mM Tris-HCl, pH 7.0 (Buffer B) at a flow rate of 1 ml/min. The proteins bound to the column 
were eluted with a linear gradient of 0 to 1 M NaCl in Buffer B at 2 ml/min for 60 min. Two 
milliliter fractions were collected and activity was determined as described in Example 1. 

Solid (NH4)2S04 was added to the above active protein fractions to a final concentration 
of 1 .7 M. Proteins were then applied to a phenyl-Superose (1 .0 x 10cm) column equilibrated 
with 1.7 M (NH4)2S04 in 50 mM potassium phosphate buffer, pH 7 (Buffer C) at 1 ml/min. 
After washing the column with 10 milliliters of Buffer C, proteins bound to the column were 
eluted with a linear gradient Buffer C to 5 mM potassium phosphate, pH 7.0 at 1 ml/min for 120 
min. The most active fractions determined by bioassay were pooled and dialyzed overnight 
against Buffer A. 

[Bom] The dialyzed sample was applied to a Mono Q (0.5 x 5 cm) column which was 

equilibrated with Buffer B at 1 ml/min The proteins bound to Ihe column were eluted at 1 
ml/min by a linear gradient of 0 to 1 M NaCl in Buffer B. The active fractions were pooled and 
adjusted to a final (NH4)2S04 concentration of 1.7M. Proteins were then applied to a phenyl- 
Superose (0.5.0 x 5cm) column equilibrated with Buffer C at 1 ml/min. Proteins bound to the 
column were eluted with a linear gradient of Buffer C to 5 mM potassium phosphate, pH 7.0 at 
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0.5 ml/min for 40 min. The purified fractions were pooled and dialyzed overnight against Buffer 
A. 

[00238] The molecular weight of the final purified toxin was examined by a gel-filtration column 

Superdex S-200. The toxin exhibited a native molecular weight of approximately 40 kDa. SDS- 
PAGE of the purified toxins showed a predominant band of approximately 40 kDa. This 
suggested that the native DAS 1529 toxin (in this fraction) was an approximately 40 kDa 
monomer. 

[00239] The purified toxin was electrophoresed in 4-20 % SDS-PAGE and transblotted to PVDF 

membrane. The blot underwent amino acid analysis and N-tenninal amino acid sequencing (SEQ 
3D NO. 1 7). Searching protein database (NCBI-NR) using the sequence as a query revealed that 
it is 95% identical to the approximately 42 kDa thiaminase I from Bacillus thiaminolyticus 
(Campobassoe/tf/., 1 998; GENBANK Accession No. 2THIA; SEQ ID NO: 18). Partial sequence 
alignments are illustrated in Figure 3, which would be the same alignment with GENBANK 
Accession No. AAC44156 (thiaminase I precursor; U17168 is the corresponding entry in 
GENBANK for the DNA, which could be expressed to get a thiaminase produced and secreted 
from a bacterial cell). The purified thiaminase from DAS 1529 was tested on corn earwonn 
(CEW), the results were shown in Figure 4. This toxin did not kill com earworm (up to a 
concentration of 8pg/cm 2 ) but exhibited 95% growth inhibition at a concentration as low as 5 
ng/cm 2 . It was also found that the purified thiaminase was not deactivated by proteinase K 

Example 5 - Cloning of Genes Encoding Insecticidal Factors Produced by DAS 1529 
[00240] In an attempt to clone the nucleotide sequence(s) that encode the insecticidal factor(s) 

produced by DAS 1 529, a cosmid library was constructed using total DNA isolated from DAS 
1 529 and was screened for insecticidal activity. Six recombinant cosmid clones were identified 
that produced insecticidal activity against corn earworm and tobacco budwonn neonates. Three 
of the cosmid clones produced heat labile (when heated at 85° C for 20 minutes) fectors that 
resulted in insect mortality. The other three cosmid clones produced heat labile factors that were 
growth inhibitory to insects. One of the cosmids that produced insect mortality, designated as 
cosmid SB 12, was chosen for nucleotide sequence analysis. 
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00241] 
V0242] 



[00243] 
[00244] 



A PnTistruction of a cosmid lib rary of DAS1529. 

Total DNA was isolated from DAS1529 with a DNA purification kit (Qiagen Inc., 
Valencia CA) Vector and insert DNA preparation, ligation, and packaging, followed 
instrucuonsf^ 

DNA fragments were cloned into the BamHL site of SuperCos 1 cosmid vector. The hgated 
product was packaged with the Gigapack® III gold packaging extract and transfected into host 
cellsXLl-BlueMRF. Transformants were selected on LB-kanamycin agar plates. Thecosmid 
library consisted of 960 randomly picked colonies that were grown in ten 96-well microliter 
plates in 200 ul LB-kanamycin (50 ^g/ml) for insect activity screening and long term storage. 
P g^nin r of DAS1 5? Q ™smiri library for insecticidal activity. 
For the primary screening for clones active against lepdidopteran insects (CEW and 
TBW), a total of 960 cosmid clones as single colonies were grown in 2 ml cultures in 96 well 
plates ' Cultures were spun and re-suspended in original culture media at approximately 10 X 
concentration and submitted to bioassay. The SuperCos 1 vector (SB1) was included as a 
negative control. Sixteen positive clones (SB2 to SB17) were isolated from the first round of 

screening. Second and third rounds of screening were carried out to screen for activity against 

TBW and CEW; one cosmid clone (SB12) consistently showing the highest activity was chosen 

for further analysis. Table 5 summarizes the activity spectrum (as tested) of the SB12 cosmid. 

(BAW is beet annyworm, Spodoptera exigua; ECB is European cornborer, Ostrinia nubilalis; 

SCRW is Southern corn rootworm, Diabrotica undecimpucata howardi.) Thebroth of SB12 E. 

coli culture both contained no CEW activity; hence, the active factors in SB12 were different 

from the active factors in DAS1529 strain culture broth. 



Table 5 
Bioassav of SB12 E. coli Clone 
TBW CEW ECB BAW 



Grubs 
n.d. 



SCR 
n.d. 




[00245] 



r sequencing "f SB12 co s mid insert and identification of tc- and cry-like ORFs. 
Nucleotide sequencing of cosmid SB12 showed that it contained a genomic insert of 
approximately 34kb. Analysis of this sequence surprisingly revealed the presence of at least 10 



I 
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putative open reading frames (ORFs) {see Figure 2). Six of the identified ORFs were surprisingly 
found to have some degree of amino acid sequence identity (38-48%) to tcaA, tcaB, tcaC, and 
tccC previously identified from Photorhabdus luminescens (Waterfield et aL, 2001), 
Xenorhabdus nematophilus (Morgan et aL, 2001), Serratia entomophila (Hurst and Glare, 2002; 
Hurst et aL, 2000), and Yersinia pestis (Crbnin et aL, 2001). Those TC protein genes from 
Photorhabdus, Xenorhabdus, and Serratia have been shown to encode insecticidal factors. Also 
very interesting was that one DAS 1 529 ORF had ~ 40% amino acid sequence identity to Cry\ Ac 
from Bacillus thuringiensis, another gene previously identified as an insecticidal factor (Schnepf 
et aL, 1998; de Maagd et aL, 2001). Those findings have significant implication in understanding 
toxin gene distribution in the bacterial kingdom and in developing further strategies for toxin 
gene mining and engineering. 
[00246] The nucleotide sequence of the SB 12 cosmid was determined. The assembled DNA of 

41,456 bp long was further analyzed. Three gaps were located: two in the cosmid vector and one 
in the insert. Analysis of the nucleotide sequence of the longest contig of approximately 34,000 
bp revealed the presence of at least 10 putative open reading frames (ORFs), identified as 
potential start codons followed by extended open reading frames. This method is known to mis- 
identify translation^ start sites by 19% {Bacillus subtilis) and 22% {Bacillus halodurans) in 
genomes related to Paenibacillus (Besemer, J., Lomsadze, A., Borodovsky, M., Nucleic Acids 
Res. 29:2607-261 8, 2001). Therefore, the quality and position of bases complementary to the B. 
subtilis 16S rRNA 5' end (reviewed in Rocha, E.P.C., Danchin, A., Viari, A., Nucleic Acids Res. 
27:3567-3576, 1999), N-terminal amino acid sequencing, and alignments to related genes were 
considered in identifying the native translation initiation sites. The putative ORFs and 
annotations are summarized in Table 6 and are discussed in more detail below. 



Table 6 

Sequence annotation for SB12 cosmid sequence 


SEQID 
NO: 


Some ORF 
similarity to: 


ORF Designation 
on SB12 


Comments 


Sequence Location 
on SEQ ID NO:l 


1 






Entire insert of SB1 2 


(1-33521) 


2 


tcaA 
(truncated at 
5") 


ORF1 




1-3264 


3 






Translation of ORF1 


(1-3261) 


4 


tcaB 


ORF2(withIS 




3271-4740 (5' end); 
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element removed) 




6079-8226 (3 'end) 


5 






Translation of ORF2 
(without insertion) 
fiom5'-mostATG 


(amino acids 1-490/ 
491-1205) 


i 6 


tcaA 


ORF3 




9521-12820 


i 7 






Translation of ORF3 


(9521-12817) 


! 8 


tcaB 


ORF4 




12827-16453 


9 






Translation of ORF4 
from 5 -most ATG 


(12827-16450) 


10 


tcaC 


ORF5 




16516-20850 


11 






Translation of ORF5 


(16516-20847) 


1 12 


tccC 


ORF6 




20867-23659 


13 






Translation of ORF6 
(from better RBS) 


(20867-23656) 


14 




ORF7 (Cryl529) 




24422-26213 


15 






Translation of ORF7 




19 


tccC 




Translation from 5- 
most ATG of ORF6 


20798-23656 



ORF1 begins with the first nucleotide of the cloning site for the DAS1529 DNA in 
cosmid SB12, and is therefore missing its native translation initiation site. ORF1 shares 
significant DNA sequence homology with ORF3, and sequence comparison analysis suggests the 
first 18 bp of ORF1 is truncated, and that the first six codons encode the amino acids Met-Val- 
Ser-Thr-Thr, as found in OFR3. The ORF1 translation start is presumably similar to that of 
ORF3, from approximately bases 9505 through 9523 of SEQ ID NO: 1 . Two predicted amino 
acid sequences are presented for ORF2, ORF4, and ORF6 (SEQ ID NOs: 19 and 13), based on 
alternative translation initiation sites, as noted above. For ORF2, SEQ ID NO:5 is discussed 
above. The alternate, and preferred, start site is at residue 3295 of ORF1. Thus, the protein 
resulting from this start site would begin at amino acid residue 9 of SEQ ID NO: 5 (translation 
from better RBS). Likewise, for OKF4, SEQ ID NO:9 is discussed above. The alternate, and 
preferred, start site is at residue 12,852 of SEQ ID NO: I . The resulting protein would also be 
missing the first eight amino acids of SEQ ID NO:9 (thus beginning with amino acid residue 9 of 
SEQ ID NO: 8 - translation from better RBS). 

Example 6 - Sequence Analysis of "Duplicated" TCs_, 

The degree of sequence identity for the two ORF2 fragments (tcaBi) compared to ORF4 
(tcaB 2 ) was determined, as was that for ORF1 (tcaAi) compared to ORF3 (tcaA 2 ). A similar 
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sequence relationship was observed for both pairs of ORFs. 
[00249] ORF2 was constructed by combining two fragments, because of an insertion sequence- 

like element which was inserted in nature (apparently spontaneously), and disrupted this ORF. 
See Figure 2. The location of this insertion is determinable by analyzing/comparing the entire 
SB12 DNA sequence (SEQ ID NO:l) with the sequence of SEQ ID NO:4, the latter of which 
does not reflect the (non-coding) insertion. As indicated with brackets in Figure 7, the sequence 
of the 5 ' translation product prior to residue 490 of SEQ ID NO:4 and the 3 ' translation product 
from residue 491 on,alignwell withORF4 (SEQIDNO:8). The DNA sequence at the apparent 
insertion point shows a 9bp direct repeat commonly found flanking insertion elements 
(CACCGAGCA, bases 4734^742 and 6071-6080 of SEQ ID NO:l). 

Example 7 ~ Further Sequence Analysis 

[00250] In summary, according to Vector NTT clustalW, GCG, and/or Blastp analyses, six of the 

identified ORFs (ORF1 to ORF6) had 38-48% amino acids sequence identity to tcaA, tcaB, tcaC, 
and tccC (previously identified Photorhabdus tc genes). The ORF7 encoded a protein that shared 
-40% amino acid sequence identity to CrylAc from Bacillus thuringiensis, another gene 
previously identified as an insecticidal factor. A phylogram was generated by incorporating 
ORF7 (Cry 1529) sequence with a large number of other Cry proteins (Figure 8). This 
phylogenetic tree suggests that Cryl529 is distantly related to other P. popilliae Cry sequences 
such as the Cry\ 8s (Zhang et al, 1997, Zhang et a/., 1998) that are closer to Cryls; Cry\ 529 falls 
(remotely but most closely) into a group of Cry proteins containing commonly found lepidoptera 
(Cryl, Cry9), coleoptera (Cry3, Cry8, Cryl), and diptera (Oy4) toxins, which is adistinct group 
compared to those including nematode toxins Cry5, -12, -13, -14, and -21 and Cry2, -18. 

[00251] It was a surprising and novel discovery to find Cry and TC protein genes (in the SB12 

genomic insert) in Paenibacillus. The identification of new Cry and TC protein genes has 
relevance to the art's understanding of Photorhabdus and Xenorhabdus, and Bacillus 
thuringiensis, and broadens the scope of bacterial genera in which Cry and TC protein genes have 
been found. The size of the full-length Cryl 529 identified herein corresponds to the core toxin of 
Cryl s; Cryl 529 represents a new class of Cry proteins which also has implications for isolating 
further cry genes from Bacillus thuringiensis and Paenibacillus. 
[00252] To verify that these surprising observations were not the result of strain contamination 
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( /e ,toconfnm1i»t&e34kbins^ 

of DAS1529), molecular analysis was carried out by Southern blot hybridization andPCR. For 
PCR validation, ORF6 (tecC-Kke) and ORF7 (Oyl529) specific primers (Example 8, Table 8) 
wereU sedtoamplifyORF6andORF7fr^^ 

PCR amplifications were performed on a PE9600 thennal cycler (Perkm Elmer) with the 
followmgparamet^srim^ 

at95°Cfor 30 seconds, annealing at 60»C for 45 se^nds, extension at 72»C for 2 ntinutes.arxd 
a final extension for 1 0 minutes at 72° C. For ORF7, amplification parameters were the same as 
ORF6, except the annealing temperature was 55° C for 30 seconds and extension at 72° C for 4 
minutes. Specific PCR products with a single band of expected sizes were amplified for both 
ORF6 and ORF7. 

Mtial southern blot hybridization was based on partial SB 12 DNA sequence and earned 
outaccoromgtostandardp^ 

DAS1529 from two independent preparations, SB12 cosmid DNA, and one negative control 
DNA sample from NCI (Photorlutbdus). Both DAS1529 DNA samples were 16S rDNA 
sequence confirmed to be of Paenibacillus sp. origin, and one was originally used for cosrmd 
library construction; the other was a new preparation. DNA samples were digested with EcoBl, 
blottedonto membrane, and hybridized with Roche DIG System (Roche) labeled 180 bp of PCR 
product amplified outofSB12. The PCRprimers are 5' CCTCACTAAAGGGATCACACG 

GS'annealingpartially to the vector and truncated ORF1 (compared to full-length ORF3), and 5' 
GGC TAA TTG ATG AAT CTC CTT CGC 3' annealing to the truncated ORF1 (fc*4-like) and 
full length ORF3 (/catlike). A total of three DNA fragments (0.85, 2.7, and 8.0 kb) hybridizing 
tothePCRprobe were detected, 0.85 and S.Oin the SB12and2.7and8.0inDAS1529DNAs. 
No signals were detected in the negative control. The 0.85 kb (from first Eco*I ORF1 internal 
fagm enttofirst£coRI^^ 

Ecom site in ORF1) matched the calculated sizes of the target DNA fragments from SB12. 
Detection of the 2.7 kb fragment suggests the presence of an EcoKl site 2.7 kb immediately 
upstream of the first £coRI site within ORF1 inDAS1529 DNA. Those results show that the 
SB12insert was from DAS1529 total DNA and, based on hybridization and restriction analysis, 
all copies of the ORFs were accounted for. 
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Example 8 - C haracterization of Insecticidal Activities of Proteins Encoded by SB12 Cosmid 
ORFs 

[00254] Random transposon insertional mutagenesis (to disrupt an individual ORF or an entire 

operon) and heterologous expression (expressing individual ORFs or entire operons) were 
undertaken to isolate individual ORF(s) or operons conferring the insecticidal activities in the 
SB12 cosmid. 

[00255] A. Random transposon mutagenesis of SB 12 cosmid 

[00256] A Tn mutagenesis library was generated from DAS1529 cosmid SB12 using the GPS-1 

Genome Priming System (New England BioLabs, Beverly, MA) following the kit instructions. 
Briefly,2nl 10X GPS buffer, l^lpGPS2.1 Donor DNA (0.02 ng), 1^1 SB12 cosmid (0.1 ^g)and 
1 8 pJ sterile H 2 0 were added to a 0.5 ml tube. One \A of TnsABC Transposase was added; the 
mixture was vortexed and then spun briefly to collect the materials at the bottom of the tube. This 
reaction mixture was incubated for 10 minutes at 37° C. One jd of Start Solution was added and 
mixed by pipetting up and down several times. The reaction was incubated at 37° C for 1 hour 
and was then heat inactivated at 75° C for 1 0 minutes. One of the reaction mixture was diluted 
10-fold with sterile H 2 0; 1 |il of the diluted reaction was electroporated into 100 \£L of Electro 
MAX DH5a-E E. coli (Gibco BRL, Rockville, MD). After 1 hour of outgrowth in SOC medium 
at 37° C, 10 jul or 100 jd were plated on LB agar plates containing 20 jag/ml Kanamycin and 1 5 
jig/ml chloramphenicol, followed by incubation overnight at 37 °C. 

[00257] Individual colonies from the SB12 Tn mutagenesis were streaked onto fresh LB agar 

plates containing 20 jug/ml Kanamycin and 1 5 ng/ml chloramphenicol. From the streaks, 50 ml 
cultures of LB containing 20 |ug/ml Kanamycin and 15 (ig/ml chloramphenicol were inoculated 
and grown at 28°C, 200 rpm for 48 hours. The cells were then collected by centrifiigation at 
3500 rpm for 20 minutes. The supernatant was removed and the pellet resuspended in 2.5 ml of 
the culture supernatant for a 20X concentration. The concentrated cell pellet was then assayed 
for activity against corn earworm. Forty jul of the 20X concentrate was surface applied to 
artificial diet using 8 wells per sample in 128 well plates. Newly hatched corn earworm larvae 
were added and allowed to feed for 5 days, at which time mortality and weights were recorded. 
[00258] A total of 1 84 clones were tested for loss of activity against corn earworm. The results 

are summarized in Table 7. Bioassay of Tn clones revealed that a Tn insertion in the Cryl 529 
gene results in complete loss of activity. Initial bioassay showed that the activities of clones 
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which carried Tn insertions in the tc genes were variable. Further analysis of those clones in 
which cultures were all normalized to the same cell density prior to bioassay showed no loss of 
activity as compared to SB12. Results from Tn analysis suggest that ORF7(Cryl 529) is the key 
insecticidally active component of SB12 cosmid. 



Table 7 

Winassa v of SB12. Cn>1529 and tc to inserti on E. coli Clones 
Insects TBW CEW THW GrubsSCRW . 

SB12 ++ +++ ' n,d 

TninCryl529 

Tnintcs __ _ . — — — 

» n.d - not determined; -, ++, +++, no, moderate and high activity, respectively 



[tms9] R Heterologous expression SBJ 2 ORFs/ operon. 

100260) Cry\529 (ORF7) and five tc ORFs (see Table 8 below) were expressed in pETlOlD® 

system. &eFignre5. This expression vector has all the attributes of the basic T7-regulatedpET 
expression system (Dubendorff and Studier, 1991; Studier and Moffatt, 1986) and allows 
directional cloningofablunt-endPCRproductintoavector for high-level, regulated expression, 
andsimplifiedproteinpurificationin£. coli. OptimalPCR amplification employed high-fidelity 
PfuTurbo™ DNA polymerase that is highly mermostable and possesses a 3' to 5' exonuclease 
proofreading activity to correct nucleotide-misincorportaion errors (Stratagene, La Jolla, CA). 
When TheimalAce™ polymerase (Invitrogen) is used, point mutations were introduced in the tc 
ORFs, which were corrected by the PfuTurbo™ based Quick-Change™ XL site-directed 
mutagenesis kit (Stratagene). The E. coli strain BL21 Star™ PE3), was used as a host for 
expression since it contains the rnelSl mutation (Lopez et al, 1999) that generally enhances 
mRNA stability and the yield of the recombinant proteins. 

mdividual ORFs were PCR amplified out of the SB1 2 cosmid with ORF specific primers 
(Table 8) under defined conditions. As a directional cloning requirement, the forward PCR 
primers were designed to contain the sequence, CACC, at the 5' end to ensure PCRproductbase 
pair with the overhang sequence, GTGG, in the pETlOLD vector. The reverse primers when 
paired with forward primers will amplify each ORF, respectively. PCR reactions were carried 
out in 50 ul reaction mixture containing of 50 ng of SB12 cosmid DNA, IX PJu reaction buffer 
(Stratagene), 0.2mM each of dNPT, 0.25 mM of each primer, and 2 U of PfuTurbo DNA 
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polymerase (Stratagene). PCR amplifications were performed on a PE9600 thermal cycler 
(Perkin Elmer) with the following parameters: initial denaturation at 95° C for 2 minutes, 35 
cycles each with denaturing at 95° C for 30 seconds, annealing at 55° C for 30 seconds, extension 
at 72° C for 2 minutes per kb ORF, and a final extension for 10 minutes at 72° C. 



Table 8. 

Summary of PCR Primers for Cloning ORF1-7 


ORFs 


Forward primers (5 1 to 3 1 ) 


Reverse primers (5* to 3 1 ) 


ORF1 (tcaA,) 


CACCATGCnTATAAGGCCTGGC 


TCAGGCCTGCACCGC 


ORF3 (tcaA 2 ) 


CACCATGGTGTCAACAACAGACAACAC 


TCAGGCTTTCGCTGCAGC 


ORF4 (tcaB 2 ) 


CACCATGACCAAGGAAGGTGATAAGC 


CTATTTCATAACATATCGAATTGG 


ORFS (tcaC) 


CACCATGCCACAATCTAGCAATGC 


TCACCGCGCAGGCGGTGAAG 


ORF6(tccC) 


CACCATGAAAATGATACCATGGACTCATC 


CTACTTTCrCrrrCATTGAAAACCGGCGG 


ORF7 
(Cryl 529) 


CACCATGAACTCAAATGAACCAAATTTATC 


AACTGGAATTAACTTCGATTC 



[00262] PCR products for each ORF were cloned into pETl 0 1 .D following instructions from the 

supplier (Invitrogen). The cloned ORF was purified as pETIO 1 .D plasmid DNA and sequenced 
verified. Since Tn analysis indicated ORF7 is the key component of SB 12 for control of the 
tested pests, biochemical analysis and insect bioassay focused on heterologously expressed ORF7 
proteins. For ORF7 expression clones, DNA sequence analysis showed 100% match with the 
original SB12 DNA sequence. Expression of ORF7 was induced by 0.5 mM IPTG for 4 hrs 
according to kit instructions (Invitrogen). 

[00263] C. Bioassay for insecticidal activities of ORF7 and tc operon . 

[00264] Bioassay samples were prepared as whole is. colt cells, cell lysates, and purified toxins. 

The spectrum and specific activity of ORF7 (Cry 1 529) is summarized in Table 10. Cryl 529 is 
most active against tobacco hornwonn (Manduca sexta) and highly active (LC50 of 10 jig/ml 
diet) against tobacco budworm {Heliothis virescens); 100% mortality was observed for both 
insects. At higher doses, Cryl 529 conferred some mortality (20 to 60%) and substantial growth 
inhibition on com earworm (Heliothis zed), beet armyworm (Spodoptera exigua), and black 
cutworm (Agrotis ipsilon). For European cornborer (Ostrinia nubilalis), Cryl 529 had some 
growth inhibition at higher doses. For some other insect species (fell armyworm, boll weevil, 
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southern rootworm, mosquito), no activity was detected. The Cry\S29 LC50s for CrylA 
(CrylAc) resistant diamond back moth (DBMr) and sensitive diamond back moth (DBM) 
colonies are >50 ng/ml and <1.0 ng/ml, respectively, suggesting a potential cross resistance. 
Cryl529 did not confer detectable activity on grass grubs, a relative of Japanese beetles. 

To test the activity of other non-Cryl 529 factors in DAS 1 529, one Cry 1 529 tn knockout 
SB12 cosmid clone (tn67) was assayed against TBW, CEW, SCRW, ECB, BW, BAW, THW, 
and grass grubs; no activity was found against these pests. To address the issue of potential non- 
or low-expression of tc ORFs in SB 12 background, individually expressed tc ORFs were tested 
independently and in combination with the other TCs from DAS 1529; no activity was found 
against TBW, CEW, and grass-grubs. Further, four ORFs were expressed as a single operon to 
very high levels in E. coli cells. When tested in vitro, the whole cells contained no detectable 
activity on TBW, CEW, and grass-grubs. While the lack of grub activity is somewhat interesting, 
these results are not surprising in that Paenibacillxis typically infect a narrow range of grub hosts. 
In light of this, it could follow that the spectrum of activity of the insecticidal toxins might also 
be relatively narrow. Thus, screens (using known methods) involving a broader range of pests, 
and additional time, would be required to identify susceptible pests. The results presented herein 
should not lead one away from recognizing that the subject TC proteins have utility as do other 
TC proteins from Xenorhabdus, Photorhabdus, and the like. 

Soluble proteins were extracted with 25 mM sodium phosphate pH 8.0, 1 00 mM sodium 
chloride and analyzed on 4-12% NuPAGE gradient gel with IX MES buffer (Invitrogen). ORF7 
protein was purified using standard procedures, and N-tenninal sequencing revealed the expected 
sequence: MNSNEPNLSDV. A bioassay was performed with whole E. coli cells, with 
normalized cell density, expressing target proteins. See Figure 6. Large scale purified ORF7 
protein was used to obtain LC50s for ORF7 by in vitro bioassay. Thermal stability analysis of the 
purified ORF7 indicated that a 5 minute treatment at 75° C is sufficient to abolish its activity 
against TBW. See Table 9. 
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Table 9. 

Thermal Stability of Purified Oyl529 (ORF7) 
Samples - Activity 



Cry\S29, room temperature 
Oyl529,50°Cfor5min. 

Oyl529, 50° C for 10 min. 
Crvl529,75°Cfor5min. 

Cry\529, 75° C for 10 mih. 

Crvl529, 100° C for 5 min. 



no and full activity, respectively 



looiei} For the tc genes, error-free clones of ORF3 and ORF6 were used as intermediate clones to 

generate a tc operon clone expressing ORF3 (tcaA), ORF4 (tcaB), ORF5 (tcaQ, and ORF6 
(tccQ. To construct the tc operon in pETlOl JD, the NsWSacI fragment containing partial tcaA, 
entire tcaB and tcaC, and partial tccC was excised out of SB12 cosmid to replace the NsWSacI 
insert in pETlOlD-tcaA; this was followed by the insertion of a 208 bp Sad fragment from 
pET101.D-tccC. See Figure 5. All four ORFS were expressed to high levels by standard IPTG 
induction. For theORF6(tccC) expressed in mete operon, the size ofthe expressed protein was 
slightly smaller than Ihe ORF6 predicted by Vector NTI from the 5 -most ATG (SEQ ID NO : 1 8) 
and expressed independently. Hence, the annotated ORF6 (SEQ ED NO:13) based on the 
presence of a ribosome binding site consensus is likely the native protein produced in SB12 and 
DAS 1529. 

D. Activity Sp ectru ms of Toxins 



[00268] 



Spectrum 


Table 10. 

activity for K coli and Pseudomonas expressed Crvl529 


Species 


Active 

(+4+) 


Format & Method 


Materia] 

Production 

Method 




H. virescens (TB W) 


+++ 


96 well top load and diet 
incorporation (scores, 
mortality, inhibition) 


FCP, SE, 
purified, IC 


llpgtox/ml diet with 
E. coli cell preps 


Kzea (CEW) 

S. exizua (BAW) 
S. frugiperda (FAW) 


+ 

+ 


96 well top load and diet 
incorporation (scores, 
mortality, inhibition) 
96 well top (score) 


FCP, SE, 
purifed, IC 

FCP, purifed 


>100pgtox/gdiet 
>78fig/cm z 


Plutellaxyhstella (DBM) 
Cry] A resistant Plutella 
xylostella (rDBM) 


+++ 
+ 


96 well top (score) 
96 well top (score) 
96 well top (score) 


FCP, purifed 
FCP, purifed 
FCP, purifed 


»10/ifi/cm 2 
0.02 MEtox/gdiet 
59.7jigtox/gdiet 
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A. ipsilon (BCW) 



O. nubilalis (ECB) 

Culex sp. (Mosquito) 

Diabrotica undecimpunctata 
howardi (SCRW) . 



96 well top (score) 



128 well top (weights) 



FCP, purifed 
FCP, purifed 



>10 /zg/cirr 



>43 ug/cm 



loz cups (mortality) 



FCP, purifed 
FCP, purifed 



>20 fig/ml H 2 0 



96 well top (score) 



Anthonomous grandis 
grandis (BW) 



128 well top (weights) 



FCP, purifed 



»100/igtox/cm 
»43 fig tox/cm 2 



M m/fl (THW) 

Continis mutabilis (Beetles); 
surrogate for grass grub 



+++ 



(hi phlv active) 
»100/igtox/gsoil 



i:^Zo, low.'moderate! high activity); FCP, frozen cel. pellets; SB, soluh.e ex tra*; purified - 
column purified Oyl529; IC, P.f. Oyl529 inclusion 

[00269) Only a limited range of pests was used in assays in an initial attempt to determine the 

activity spectrum of the subject TCs/tcORFs. The following data, using the ORF3-OR6 operon, 



Ta 

Spectrum act 


ible 11. 

ivitvfor Tc ORFs 




Species 


Active 
(+++) 


Format & 
Method 


High 
Dose 


Material 
Production 
Method 


Comments 


K virescens 
(TBW) 




96 well top 
(score) 


lOx 


FCP 


No effect 


H. zea (CEW) 




96 well top 
(score) 


lOx 


FCP 


No effect 


S. exigua 
(BAW) 




96 well top 
(score) 


lOx 


FCP 


No effect 


Spodoptera 

frugiperda 

(FAW) 




96 well top 
(score) 


lOx 


FCP 


No effect 


A. ipsilon 
(BCW) 




96 well top 


lOx 


FCP 


No effect 



[00270, Again, while this initial round of screening did not reveal activity of these TCs against 

these pests, one skilled in the art would not doubt that the subject proteins are use&l, as are the 
corresponding Photorhabdus/Xenorfobdus proteins. In addition, see Example 10, below. 

Fv am ? 1 ft 9 - Use of PCR primers f or identifying Crvl 529 homologues from other bacterial 
genera, specie s, and strains 
[0027,] For screening additional ORF7 cryl529 homologs from other (Paenibacillus or other) 

strains, gene specific and degenerate PCR primers were designed to amplify the target ORF7 
DNA sequences of 1 kb. The PCR primers were deduced from two, well-conserved protein 
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motifs (QAANLHL, domain I, block 1 core for forward primer, GPGFTGGD, domain m, block 
3 for reverse primer) highly conserved in Cry proteins. Those primers are listed in Table 12 and 
were validated on DAS 1 529. PCR amplifications were performed on a PE9600 thermal cycler 
(Perkin Elmer) with the following parameters: initial denaturation at 95° C for 2 minutes; 35 
cycles each with denaturing at 95° C for 30 seconds, annealing at 47° C for 45 seconds, extension 
at 72° C for 2 minutes, and a final extension for 10 minutes at 72° C. Those primer pairs were 
used to screen a bacterial (non-£. thuringiensis) culture collection by PCR. Five out of 192 
strains (three Paenibacillus, one Bacillus, and one unidentified) produced PCR products of 
expected sizes. These strains were also found to have CEW activity according to primary 
bioassay screening. However, sequence analysis of amplicons obtained from one of these strains 
using different primers showed that the amplicons were not derived from a cry gene. 
[00272] Notwithstanding this, and as these screens were not exhaustive, the subject invention 

includes methods of screening Paenibacillus spp., Bacillus spp. (including Bacillus thuringiensis 
and sphaericus), and the like for Cryl 529-like proteins and genes. Given the significant nature 
of the discovery of lepidopteran-toxic Cry proteins in Paenibacillus, the subject invention also 
includes methods of screening Paenibacillus spp., generally, for lepidopteran-toxic Cry proteins 
and genes. Various screening methods are well-known in the art, including PCR techniques (as 
exemplified above), probes, and feeding assays (where whole cells are fed to target pests). As 
one skilled in the art would readily recognize, screening methods of the subject invention include 
the preparation and use of clone libraries (such as cosmid libraries) in these screens. 



Table 12 

PCR Primers for Screening ORF7 Homologs 

Gene-specific and degenerate Primers DNA sequence (5 1 to 3*) 



Cryl529-F . 


CAAGCAGCCAACCTCCACCTA 


Cry 1529-R 


ATCCCCTCCTGTAAAGCCTGG 


CryPT-F 


CAAGCNGCNAATYTWCATYT 


C/yPP-R 


TCNCCN CCNGTAAANCC WGG 


CryPT-F 


CARGCSGCSAAYYTBCAYYT 


CryPP-F2 


CAAGCWGCWAATYTWCATYT 


CryPP-R2 


TCHCCWCCWGTAAAWCCWGG 


CryPT-F2 


CAGGCSGCSAAYYTGCATYT 



1529=gene specific; PP=i> popilliae codon bias; PT=P. thiaminolyticus codon bias 
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[00274] 
[00275] 



[00276] 
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F.y nm p 1e 10 - Co m plementation of Yenorhahdus Xr>tA2 TC Protein Toxin with DAS 1529 TC 
Proteins 

This example provides experimental evidence of the ability of DAS1529TC proteins, 
expressed here with a single operon (ORFs 3-6; tcaA, tcaB, TcaC and tccC; see section C of 
Example 8), to complement the XptA2 toxin &omXmorhabdusnematophilusX<m(seeSmK> 
NO:49). Two independent experiments were carried out to express the DAS1529 TC operon and 
XptA2 independently, or to co-express the XptA2 gene and the TC operon in the same E. coli 
cells. Whole cells expressing different toxins/toxin combinations were tested for activity against 
the lepidopteran insects: com earworm (Heliothis zea; CEW) and tobacco budworm {Heliothis 
virescens; TBW). The data from both experiments indicate that DAS 1 529 TC proteins are able to 
enhance Xenorliabdus TC protein XptA2 activity on both insect species tested. 
A Co-exnression of DAS1529 TCs and Xenor habdus XntA2 

Expression of the TC operon was regulated by the T7 promoter//ac operator in the 
pETlOl .D expression vector that carries a ColEl replication origin and an ampicillin resistance 
selection marker (Invitrogen). Comprehensive description of cloning and expression of the to 
operon can be found in section C of Example 8. The XptA2 gene was cloned in the pCot-3 
expression vector, which carries a chloramphenicol resistance selection marker and a replication 
origin compatible with the ColE 1 . The pCot-3 vector expression system is also regulated by the 
T7 promoter/toe operator. Hence, compatible replication origins and different selection markers 
form the basis for co-expression oftheTC operon andXptA2 in the same£. coli cells. Plasmid 
DNAs carrying the TC operon and XptA2 were transformed into E. coli, BL21 Star™ (DE3) 
either independently or in combination. Transformants were selected on LB agar plates 
containing 50 ug/ml carbenicillin forpET101.D-TC operon, 50ug/ml chloramphenicol for pCot- 
3-XptA2, and both antibiotics forpETlOl .D-TC operon/pCot-3-XptA2. To suppress basal toxin 
expression, glucose at a final concentration of 50 mM were included in both agar and liquid LB 
medium. 

For toxin production, 5ml and 50 ml of LB medium containing antibiotics and 50 mM 
glucose were inoculated with overnight cultures growing on the LB agar plates. Cultures were 
grown at 30°C on a shaker at 300 rpm. Once the culture density has reached an O.D. of ~ 0.4 at 
600 nm, IPTG at a final concentration of 75 pM was added to the culture medium to induce gene 
expression. After 24 hours, E. coli cells were harvested for protein gel analysis by the NuP AGE 
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system (Invitrogen). Cell pellets from 0.5 ml IX culture broth was resuspended in 100 pi of IX 
NuPAGE IDS sample buffer. Following brief sonication and boiling for 5 mm, 5 pi of the 
s^ple was loaded onto4to^NuPAGEbis-^ 

XptA2 expressed to detectable levels when expressed independently or in the presence of the TC 
operon. Based on gel scan analysis by a Personal Densitometer SI (Molecular Dynamics), XptA2 
expressed nearly 8X as high by itself as when co-expressed with the TC operon. For the 5 ml 
induction experiment, there is a nearly equal expression of XptA2. 
[00277] B. Bioassav for Insecticidal Activity 

[00278] A s d escribedmExample8,DAS1529tcORFswhenexpressedindependentl y orasan 
operon, did notappear to be active against TBW and CEW. The following bioassay experiments 
focused on determining whether Paenibacillus (DAS1529) TC proteins (of ORFs 3-6; TcaA-, 
TcaB-, TcaC-, and TccC-like proteins) can complement Xenorliabdus TC protein toxin activity 
(XptA2 is exemplified). Bioassay samples were prepared as whole E. coli cells in 4 X cell 
concentrate for the 5ml induction experiment, both the XptA2 and XptA2/TC operon cells 
contained very low but nearly equal amount of the XptA2 toxin. Data in Table 13 showed matat 
the 4X cell concentration tested, TC proteins + Xenorliabdus XptA2 was active against CEW. 
This provided the first evidence of a complementation effect of Paenibacillus DAS1529 TC 
proteins on Xenorhabdus XptA2. 



Table 13 Bioassay of DAS1529 TC complementation of Xeno. XptA2 on H. zea 

Insects: CEV^ 

Negative control ' I ' 

TCs(DAS1529) 
Xeno. XptA2 
TC proteins +Xeno. XptA2 



* -, ++, +++ = no, moderate and high activity, respectively ' 

[00279] For the second bioassay experiment, the amount of XptA2 protein in the XptA2 cells and 

the XptA2 + TC operon cells was normalized based on densitometer gel scan analysis. As shown 
in Table 14, Xj>tA2per jehad moderate activity at 40X on TBW (H. virescens), but the activity 
dropped to a level undetectable at and below 20X. However, when co-expressed withTCs, high 
levels of activity were very apparent in the presence of 10X and 5XXptA2, and low activity was 
still noticeable at 1.25X XptA2. These observations indicate there is a significant potentiation 
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effect of 1529 TC proteins onXenorhabdus XptA2 against K virescens. At the highest doses 
tested, neither the negative control nor the tc operon per se had any activity against this pest . 



Table 14 Bioassay of IDAS1S29 TC complementation of XptA2 






on H. virescens 






Normalized 


40X 


20X 10X 


5X 


2.5X 1.25X 


XptA2 










XptA2 
TCs +XptA2 


+ 

n.d. 


n.d. ++ 


++ 


n.d. n.d. 
+ 



* n.d. - not determined; +, ++, +++ - no, low, moderate, and high activity, respectively 



Example 1 1 - Stabilization of Crvl529 protein against trypsin digestion. 

(00280) This example teaches modifications to the DNA sequence disclosed as SEQ ID NO: 14, 

which encodes the Cry 1529 protein (disclosed as SEQ ID NO: 15) such that the new encoded 
proteins are more resistant to proteolytic digestion by trypsin than is the native protein. Digestion 
of proteins in the gut of insects limits the time of exposure of the insect to a protein toxin. 
Therefore, methods that decrease the susceptibility of a protein toxin to protease digestion can be 
used to increase potency of the protein. 

[0028I] For these tests, trypsin enzyme (e.g. Sigma Chemical #T1426) and trypsin inhibitors (e.g. 

Sigma Chemical #T9008) were prepared as stock solutions of 4 mg/mL or 1 0 mg/mL in 50 mM 
Tris HC1 buffer, pH8.0. Test incubations with various concentrations of trypsin and Cryl529 
protein were performed at 37°C for 1 hour, and were terminated by addition of an equal volume 
of an equal concentration of trypsin inhibitors (eg. a digestion that received 35 \iL of 4 mg/mL 
trypsin solution was terminated by addition of 35 jil of 4 mg/mL trypsin inhibitors). For a typical 
experiment, Cryl 529 protein was produced by appropriately engineered E. coli cells and purified 
by steps described previously, which included separation from other proteins by passage through 
a size-exclusion column. Following digestion, the protease products were analyzed by standard 
acrylamide gel electrophoresis followed by immunoblot analysis using antibody prepared against 
the Cryl 529 protein. The results of such an experiment are shown in Figure 9. 

[00282] Trypsin digestion produces two major protein products, the smaller of which is 

approximately 50 kDa in molecular size. It is noted that this digestion pattern is the same as that 
produced from trypsin digestion of a Cry 1529-His 6 protein, which is identical to the native 
Cryl529 protein amino acid sequence of SEQ ID NO:15 except for the addition of amino acids 
KGEIJSfSKLEGKPIPNPLLGLDSTRTGHHHHHH to the caiboxy-tenninus. The coding region 
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for Cryl 529-His 6 was produced by ligating the coding region for the native Cry 1 529 protein into 
the pETlOl/D-TOPO® vector (Invitrogen™, Carlsbad, CA). This recombinant clone was made to 
facilitate purification of the recombinant Cryl 529 protein by binding to a commercially available 
V5 antibody, whose epitope is represented by the amino acid sequence GKPIPNPLLGLDSTRTG 
(underlined above), or by purification schemes that expoit the six histidine residues (double 
underlined above). Procedures for these manipulations were performed according to the 
recommendations provided with the pETl 01/D-TOPO® vector. 

[00283] Trypsin digestion of the Cry 1529-His 6 protein was found to eliminate activity in insect 

bioassays against lepidopteran insects. MALDI-TOF analysis was used to determine the sequence 
of amino acids composing the N-terminus of the 50 kDa peptides, and two protease processing 
sites were determined, corresponding to amino acid residues 122 (R, Arginine) and 126 (K, 
Lysine) of SEQ ID NO:15. 

[00284] Modifications to remove the first trypsin cleavage site in the encoded protein were made 

in the native DNA sequence (SEQ ID NO: 14), using the QuickChange® mutagenesis 
methodology (Stratagene, La Jolla, CA). Three different types of mutations were made at amino 
acids in the region of 120 to 123 of SEQ ID NO:15: RARA to HANA, RARA to RARS, and 
RARA to QANA. The DNA oligonucleotide primers (listed in the 5 f to 3' direction for each 
strand) for these mutations are listed in Table 15 below. The bases that differ from the native 
DNA sequence are underlined. 



Table 15 


Mutation 


Forward (Coding strand) Primer 


Reverse (Complementary 
strand) Primer 


RARA to HANA 
(PMYC2865) 


AAAATGATTCTAATAATTTACACGCGAAC 
GCTGTAGTGAAAGAC 


GTCTTTCACTACAGCGTTCGCGTGTAAATTA 
TTAGAATCATTTT 


RARA to QANA 
(pMYC2866) 


AAAATGATTCTAATAATTTACAAGCGAAC 
GCTGTAGTGAAAGAC 


GTCTTTCACTACAGCGTTCGCTTGTAAATTA 
TTAGAATCATTTT 


RARA to RARS 
(pMYC2867) 


AAAATGATTCTAATAATTTAAGAGCGAGA 
TCTGTAGTGAAAGAC 


GTCTTTCACTACAGATCTCGCTCTTAAATTA 
TTAGAATCATTTT 



[002851 Comparison of the wild type and mutated coding regions induced by these primers are 

shown in this Table. The pertinent amino acid residues are shown in bold type. 
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Table 16 



[00237] 



Wild-type: gAA AAT GAT TCT AAT AAT TTA AGA GCG AGA GOT GTA GTG AAA GAC 
Amino Acids: (E) . S . N L ^ A ^ ^ V ^ £ 

KARA to HAN A: gAA AAT GAT TCT AAT AAT TTA CAC GCG AA_C GCT GTA GTG AAA GAC 
Amino Acids: (E)NDSNNLHANAVVKD 

KARA to QANA: gAA AAT GAT TCT AAT AAT TTA CAA GCG A£C GCT GTA GTG AAA GAC 
AMINO ACIDS: (E)NDSNNLQANAVVKD 

RARA to RARS : gAA AAT GAT TCT AAT AAT TTA AGA GCG AGA TjjT GTA GTG AAA GAC 
Amino Acids: (E)NDSNNLRA 

The separate, mutated coding regions were each cloned into the pETlOl/D-TOPO 
vector, which allows inducible production of the Cryl529 variant proteins. E. coli cells 
containing the constructs were grown, and expression oftheCryl 529 variant genes was induced 
by methods recommended by the supplier. Harvested whole cells were then tested in trypsin 
digestion assays, and analyzed as above. Typical results are shown in Figure 10. For these 
experiments,10mgof whole cell pellet was suspended in 50 mMTrisHCl,pH8.0, and digested 
for 3 hours at 37° in a final volume of 1 mL, with 100 uL of 10 mg/mL trypsin. The reactions 
were mixed occasionally during incubation. Digestion was terminated by addition of 1 00 uL of 
10 mg/mL trypsin inhibitors and the tubes were stored on ice. 

These results demonstrate that both the native Cry\ 529 (RARA) and the Cry 1 529-His 6 
(RARA) proteins are digested by trypsin to produce a major product of about 50 kDa. When the 
RARA sequence corresponding to the trypsin cleavage site was mutated to HANA or QANA, 
substantial resistance to trypsin digestion was obtained. No 50 kDa peptides were produced, and 
easily detectable amounts of the apparently full-length Cryl529-His 6 proteins were present. 
Mutation of fhe RARA site to RARS did not eliminate production of the 50 kDa peptides, but 
substantiallyreduced the rate of protease cleavage. Thus, it is apparent that mutation ofprotease 
processing sites in the C/yl529 protein substantially decreases its susceptibility to protease 
digestion. This allows the proteins to reside for longer periods of time in the insect gut following 
ingestion, resulting in increased potency to kill susceptible insects. 



Fvam ple 12 - P "si pn of PCR Primers for Det e nt, nf homologues of IDAS 1529 tcORFs i: 
other Paeni hririllwi strains 
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[00288] As shown above, Paenibacittus strain IDAS 1 529 produces an extracellular protein that is 

toxic to various Lepidopteran insects. Molecular phylogeny of the 16S ribosomal gene of this 
strain indicates that it is most closely related to members of the P. thiaminolyticus-P. 
lentimorbus-P. popilliae cluster. It has also been shown that Paenibacillus strain IDAS 1529 
contains both toxin complex genes (hereafter designated as tc genes) and a novel insecticidal 
crystalline inclusion protein gene designated cry 1529. In an attempt to determine if tc 
homologues are present in other members of the genus Paenibacillus, a collection of 
Paenibacillus strains was screened by polymerase chain reaction (PCR) and hybridization 
analyses. For the PCR analyses, total DNA isolated from Paenibacillus strains was used as 
template and screened using oligonucleotide primers specific to tc genes found in Paenibacillus 
strain IDAS 1529, Photorhabdus species, and Xenorhabdus species. Amplified products 
obtained with the tc primer sets were cloned and their nucleotide sequence was determined and 
compared to tc sequences obtained from Paenibacillus strain IDAS 1529. The following 
Examples illustrate how one can design fc-specific oligonucleotide primers and use PCR to 
search the total DNA of Paenibacillus isolates for DNA sequences that are homologous to tc 
genes identified in Paenibacillus strain IDAS 1529, Photorhabdus species, and Xenorhabdus 
species. By using PCR analysis (as described herein), it was (and is) possible to identify tc 
homologues in a species of Paenibacillus distinct from Paenibacillus strain IDAS 1 529 and the 
P. thiaminolyticus-P. lentimorbus-P. popilliae cluster. 

12 A. - Extraction of total DNA from Paenibacillus strains 

[00289] Paenibacillus strains were grown on nutrient agarplates (8 g/1 nutrient broth, 15 g/1 Bacto 

agar, Difco Laboratories, Detroit, MI) for 3-5 days at 30°C. A single colony was picked and 
inoculated into a 500 ml tribafQed flask containing 1 00 ml of sterile nutrient broth (8 g/1 nutrient 
broth; Difco Laboratories, Detroit, Ml). Following 24-72 hrs of incubation at 30°C on a rotary 
shaker at 150 rpm, the cultures were dispensed into sterile 500 ml polyethylene bottles and 
centrifuged at 6,500xg for 1 hr at 4°C. After centrifugation, the supernatant fluid was decanted 
and the bacterial cell pellet was retained. Total DNA was extracted from the cell pellet using the 
QIAGEN Genomic-tip System 100/G and associated Genomic DNA Bufifer Set (QIAGEN Inc., 
Valencia, CA, USA) by following The Sample Preparation and Lysis Protocol for Bacteria 
exactly as described by the manufacturer. The extracted total DNA was solubilized in 0.5 ml TE 
buffer (10 mM Tris-HCl, pH 8.0; 1 mM EDTA, pH 8.0). 



WO 2004/002223 



PCT7US2003/020082 



74 

12.B. - Selection of tc specific oligonucleotide primers for PCR 

To select oligonucleotide primers specific to the tc genes previously identified from 
Paetiibacillus strain IDAS 1529, the tcaA, tcaB, tcdB and tccC nucleotide sequences obtained 
from Paenibacillus strain IDAS 1 529, Photorhabdus strain W14, and Xenorhabdus strain Xwi 
were aligned using the Align program in the Vector NTI software package (Informax, Inc., 



Frederick, MD). Nucleotide sequences used for this analysis are listed in Table 17. 



Tab 


le 17. Nucleotide sequences used for tc specific primer select 


ion 


Gene 


Source organism 


Source of nucleotide 
sequence 


Gene 
Designation 


tcaAl 


Paenibacillus strain IDAS 1529 


SEOIDNO:2 


tcaAl-1529 


tcaA2 


Paenibacillus strain IDAS 1529 


SEOIDNO:6 


tcaA2-1529 


tcaA 


Photorhabdus strain W14 > 


GenBank: Accession 
No. AF346497 


tcaA-WU 


tcaBJ 


Paenibacillus strain IDAS 1529 


SEOIDNO:4 


tcaBl-1529 


tcaB2 


Paenibacillus strain IDAS 1529- 


SEQIDNO:8 


tcaB2-1529 


tcaB 


Photorhabdus strain W14 


GenBank: Accession 
No. AF346497 


tcaB-W14 


tcdBl 


Photorhabdus strain W14 


SEOIDNO:42 


tcdBl-W14 


tcdB2 


Photorhabdus strain W14 


SEOIDNO:43 


tcdB2-W14 


xptCl 


Xenorhabdus strain Xwi 


SEQIDNO:20 


xptCl-Xwi 


tcaC 


Paenibacillus strain IDAS 1529 


SEOIDNO:10 


tcaC-1529 


tccCl 


Photorhabdus strain W14 


SEOIDNO:44 


tccCl-W14 


tccC2 


Photorhabdus strain W14 


SEQIDNO:45 


tccC2-W14 


tccC3 


Photorhabdus strain W14 


SEQIDNO:46 


tccC3-W14 


tccC4 


Photorhabdus strain W14 


SEQIDNO:47 


tccC4-W14 


tccCS 


Photorhabdus strain W14 


SEQIDNO:48 


tccC5-W14 


xptBl 


Xenorhabdus strain Xwi 


SEQIDNO:21 


xptBl-Xwi 


tccC 


Paenibacillus strain IDAS 1529 


SEQIDNO:19 


tccC-1529 



12.B.L - tcaA specific primer selection 

Nucleotide sequence alignment of tcaAl-1529, tcaA2-1529, and tcaA-W14 identified two 
regions of nucleotide sequence identity of sufficient length for the selection of PCR primers with 
minimal degeneracy (shown as boxed regions in Figure 10.). These two regions were selected for 
the synthesis of tcaA specific primers, which were designated SB105 and SB106 (Tables 18 and 
19). 

12.B.ii. - tcaB specific primer selection 

Nucleotide sequence alignment of tcaBl-1529, tcaB2-1529, and tcaB-W14 identified four 
regions of nucleotide sequence identity of sufficient length for the selection of PCR primers with 
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minimal degeneracy (Figure 11.). These four regions were selected for the synthesis of tcaB 
specific primers, which were designated as SBlOl, SB102, SB 103, and SB104 (Tables 18 and 
19). 

12.B.iii. - tcaC specific primer selection 

[00293] Nucleotide sequence alignment of tcdBl-W14 t tcdB2-W14, xptCl-Xwi and tcaC-1529 

identified two regions of nucleotide sequence identity of sufficient length for the selection of 
PCR primers with minimal degeneracy (Figure 12.). These two regions were selected for the 
synthesis of tcaC specific primers, which were designated as SB215 and SB217 (Tables 18 and 
19). 

12.B.iv. - tccC specific primer selection 

[00294] Nucleotide sequence alignment of tccCl-W14, tccC2-W14 t tccC3-W14, tccC4-W14, 

tccC5-W14, xptBl-Xwi and tccC-1529 identified two regions of nucleotide sequence identity of 
sufficient length for the selection of PCR primers with minimal degeneracy (Figure 13.). These 
two regions were selected for the synthesis of tccC specific primers, which were designated as 
SB212 and SB213 (Tables 18 and 19). 



Table 18. tc specific primers 


Primer 
designation 


Primer 
length 


Sequence of primer (5' to 3') 


SEQBD 
NO. 


SBlOl 


32 


GCKATGGCSGACCCGATGCAWTACAAGCTGGC* 


22 


SB102 


32 


AGCGGYTGACCRTCCAGRCTCARATTGTGGCG 


23 


SB103 


28 


TGTATAACTGGATGGCYGGWCGTCTSTC 


24 


SB104 


26 


TCRAAAGGCAGRAAMCGGCTGTCGTT 


25 


SB105 


28 


CTTCY CTKGATATCYTKYTGGATGTGCT 


26 


SB106 


30 


ACGRCTGGYATTGGYAATCAGCCARTCCAA 


27 


SB212 


27 


CGYTATIAATATGAYCCKGTVGGYAAT 


28 


SB213 


25 


CATCBCGYTCTTTRCCIGARTARCG 


29 


SB215 


33 


CGHAGCTCYICCCAGTWYTGGCTGGATGARAAA 


30 


SB217 


32 


GTRTCATTTTCATCTTCRTTBACIRYAAACCA 


31 


*K=GorT;S=GorC;W= 
or C or T, 1= inosine 


A or T; Y= C or T; R= G or A; M= A or C; V= G or C or A; B= G or T or C; H= A 


Table 19. tc primer combinations 


Target 
gene 


Forward 
primer 


Reverse Approximate size of expected amplified 
primer product 


tcaA 
tcaB 
tcaB 


SB105 
SBlOl 
SB 103 


SB106 1.4 kb 
SB102 0:4 kb 
SB104 0.65 kb 
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tcdB SB215 SB217 2.2kb 

tccC SB212 SB213 2£J* 

Sample 13 - P^R amplifica tion of PaenibacMus DNA 

For PCR amplification using tcaA- and tazB-specific primer sets, 3-5 ul of total DNA 
obtained ftom each of thePa«ii6fl«flt« strains wasmixedwifoSOpmolesofeachprimerandlX 
Eppendorf MasterMix (Eppendorf AG; Hamburg, Germany) in a 20 ul reaction volume. 
Amplification conditions were denaturation at 94 6 C for 3 minutes followed by 30 cycles of 
denaturation at 94°C for 1 minute, annealing at 52°C for 1 .5 minutes, and extension at 72°C for 
1.5 minutes, followed by a final extension at 72°C for 5 minutes. 

For PCR amplification using tcaC- and fccC-specific primer sets, approximately 375 ng 
of total DNA obtained from each of the Paenibacillus strains was mixed with 50 pmoles of each 
primer and 12.5 ul of Epicentre* FailSafe™ Buffer D and 2.5 U of Epicentre® FailSafe™ 
Polymerase (Epicentre; Madison, WI)ina25ul reaction volume. Amphfication conditions were 
denaturation at96°Cfor4minutes followed by 40 cycles of denaturation at 94°C for 30 seconds, 
annealing at 64°C for 30 seconds, and extension at 70°C for 30 seconds. Each cycle, the 
annealmgtemperaturewasloweredbyO.5^ 

13.A. - Gel electrophoresis, cloning, and nucleotide sequence determination of PCR 

amplified products 

PCR amphfication reactions were examined by gel electrophoresis using 0.8 to 1% 
Seakem LE agarose (BioWhittaker Molecular Applications, Rockland, ME) in IX TAE buffer. 
Ampimedproductswereclonedm^^^ 

(Invitrogen™ Life Technologies, Carlsbad, CA) exactly as described by the manufacturer. The 
nucleotide sequences of the cloned amplified products were determined using M13 Forward, 
Ml 3 Reverse, and tc sequence-specific sequencing primers as needed to obtain double stranded 
sequenceofeachclonedamplifiedproduct Nucleotide sequencing was performed using IheCEQ 
I^eTeminator Cycle SequencmgQ^^ 

the CEQ 2000 XL DNA Analysis System (Beckman Coulter) exactly as directed by the 
manufacturer. The Sequencher (v. 4. 1 .4) software package (Gene Codes, Ann Arbor, MI) was 
used to construct contigs from the sequencing data and determine a consensus sequence for each 
amplified product. 
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13.B. - Nucleotide sequence analysis of PCR amplified products 
13.BA.-tcaA 

[00298] When PCR using the tcaA- (primer combination SB 1 05 and SB 1 06) was performed using 

total DNA obtained from the collection of Paenibacillus strains, it was observed that total DNA 
from a Paenibacillus apiarius strain (NRRL NRS 1438; hereafter designated as DB482) 
produced an amplified product of the expected sizes. The amplified product was cloned and 
. sequenced. 

[00299] The amplified product obtained using the SB 105 and SB 106 primer combination was 

designated as tcaA2-DB482. When the sequence of tcaA2~DB482 (SEQ ID NO:32) as compared 
. to the tcaA sequences obtained from Paenibacillus strain IDAS 1529 and Photorhabdits strain 
W14, it was observed that tcaA2-DB482 have the greatest nucleotide sequence identity (90.5% 
over 1 ,239 nucleotides) to tcaA2-1529 (Table 20). The deduced amino acid sequence encoded by 
tcaA2-DB482 (designated as TcaA2-DB482; SEQ ID NO:33) was 89.1% identical to the 
corresponding deduced amino acid sequence of tcaA2-1529 (designated as TcaA2-l 529; SEQ ID 
NO:7). 

Table 20. Nucleotide and deduced amino acid sequence identity of tcaA2-DB482 with 

corresponding regions of tcaAl-1529, tcaA2-1529 9 and tcaA-W14 

% deduced amino acid 
% Nucleotide identity with sequence identity with 

Gene tcaA2-DB482 tcaA2-DB482 

tcaAl-1529 57 33 
tcaA2-1529 90 89 
tcaA-W14 50 32 

13JB.ii.-tcaB 

[00300) The amplified products obtained using the SB 1 0 1 and SB 1 02 primer combination and the 

SB 103 and SB 104 primer combination were designated as tcaB2a-DB482 and tcaB2b-DB482, 
respectively. When the sequences of tcaB2a-DB482 (SEQ EDNO:34) and tcaB2b-DB482 (SEQ 
ID NO: 35) were compared to the tcaB sequences obtained from Paenibacillus strain IDAS 1 529 
and Photorhabdus strain W14, it was observed that both of these sequences have the greatest 
nucleotide sequence identity to tcaBl-1529 and tcaB2-1529 (Table 21). The nucleotide 
sequence identity of tcaB2a-DB482 and tcaB2b-DB482 to tcaB2-1529 was 92.6% and 89.8%, 
respectively. The deduced amino acid sequences encoded by tcaB2a-DB482 (designated as 
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TcaB2a-DB482; SEQ ID NO:36) tcaB2b-DB482 (designated as TcaB2b-DB482; SEQ IDNO:37) 
were 91 .2% and 91.1% identical, respectively, to the corresponding deduced amino acid sequence 
oitcaB2-1529 (designated as TcaB2-l 529; SEQ ID NO:9). 

Table 21. Nucleotide and deduced amino acid sequence identity of tcaB2a-DB482 and tcaB2b- 

DB482 with corresponding regions of tcaBl-1529, tcaB2-1529, and tcaB-W14 , 

% Nucleotide % Nucleotide % deduced amino % deduced amino 
identity with identity with acid sequence with acid sequence with 
Gene tcaB2a-DB482 tcaB2b-DB482 TcaB2a-DB482 T caB2b-DB482 
tcaBl-1529 93 ~~ 93 94 92 
tcaB2-1529 93 90 91 92 
tcaB-W14 63 57 59 57 

13.B.iii. - tcdB 

[moi] When PCR using the taiC-specific primer combination (SB215 and SB217) was 

performed using total DNA obtained from DB482 produced an amplified product of the expected 
size. The amplified product was cloned and sequenced. 

loom) The amplified product obtained using the SB215 and SB217 primer combination was 
designated as tcaC-DB482. When the sequence of tcaC-DB482 (SEQ ED NO:38) was compared 
to the tcaC sequences obtained from Paenibacillus strain IDAS 1529, Xenorhabdus strain Xwi 
and Photorhabdus strain W14, it was observed that tcaC-DB482 has the greatest nucleotide 
sequence identity (93.5% over 2,091 nucleotides) to tcaC-1529 (Table 22). The deduced amino 
acid sequence encoded by tcaC-DB482 (designated as TcaC-DB482; SEQ ID NO:39) was 91 . 1% 
identical to the corresponding deduced amino acid sequence oitcaC-1529 (designated as TcaC- 
1529; SEQ ID NO:ll). 

Table 22. Nucleotide and deduced amino acid sequence identity of tcaC-DB482 
corresponding regions of xptCl-Xwi, tcdBl-W14, and tcdB2-W14, and tcaC-1529 

% Nucleotide sequence % deduced amino acid sequence 

Gene identity with tcaC-DB482 identity with TcaC-DB482 

tcaC-1529 93 91 
xptCl-Xwi 50 35 
tcdBl-W14 50 36 
tcdB2-W14 50 36 



13.B.iv.-tccC 
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When PCR using the fccC-specific primer combination (SB212 and SB212) was 
performed using total DNA obtained from the collection of Paenibacillus strains, it was observed 
that total DNA from DB482 produced an amplified product of the expected size. The amplified 
product was cloned and sequenced. 

The amplified product obtained using the SB212 and SB213 primer combination was 
designated as tccC-D£482. When the sequence of tccC-DB482 (SEQ ID NO:40) was compared 
to the tccC sequences obtained from Paenibacillus strain IDAS 1 529, Xenorhabdus strain Xwi 
and Photorhabdus strain W14, it was observed that tccC-DB482 has the greatest nucleotide 
sequence identity (93.7% over 858 nucleotides) to tccC-1529 (Table 23). The deduced amino 
acid sequence encoded by tccC-DB482 (designated as TccC-DB482; SEQBDNO:41) was 95.5% 
identical to the corresponding deduced amino acid sequence of tccC-1 529 (designated as TccC- 
1529; SEQIDNO:13). 



Table 23. Nucleotide and deduced amino acid sequence identity of tccC-DB482 
corresponding regions of xptBl -Xwi, tc-W14, tccC-1529, and tec genes from Photorhabdus 
strain W14 . 



Gene 


% Nucleotide sequence 
identity with tccC-DB482 


% deduced amino acid sequence 
identity with TccC-DB482 


tccC-1529 


94 


96 


xptBl-Xwi 


54 


45 


tccCl-W14 


54 


48 


tccC2-W14 


56 


45 


tccC3-W14 


56 


46 


tccC4-W14 


56 


46 


tccC5-W14 


54 


44 



13.C. - Summary of PCR analyses 

[00305] This example (and other examples herein) illustrate methods for designing 

oligonucleotide primers based on tc genes from three genera of bacteria, and that the use of these 
primers for PCR screening of Paenibacillus strains can identify tc homologues present in those 
strains. DB482, which is an isolate of Paenibacillus apiarius (deposited as NRRL B-30670) that 
was isolated from honey bee larva, was shown to contain homologues of tcaA, tcaB 9 tcaC, and 
tccC. The finding of these tc homologues confirms that Paenibacillus strain IDAS 1 529 is not 
unique within the genus Paenibacillus with regard to possessing tc genes. Therefore, one skilled 
in the art can now use these and other methods to identify other tc homologues in other species of 
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[00306] 



[00307] 



[00308] 



PaenibaciUus such as P. chondroitinus, P. alginolyticus, P. larvae, P. validus, P. gordonae, P. 
aWei, P. lentimorbus, P. popilliae, P. thiaminolyticus, P. curdlanolyticus, P. kobensis, P. 
glucanolyticus, P. lautus, P. chibensis, P. macquariensis, P. azotofixans, P. peoriae, P. 
polymyxa,P. illinoisensis, P. amylolyticus, P, pabuli, and P. macerans. 

i^ pl, 14- Detect*™ nf hnmolooues o f ™as toORFS in other PamihacUlus stoAns 

bv Southern Hybridization 

This example illustrates how one can use radioactively labeled DNA fragments as probes 
to search the genomic DNA of PaenibaciUus isolates for DNA sequences (preferably having 
somehomologytotheknowntcORFsfirstdetectedinlDAS 1529). The results demonstrate mat 
sequences homologous to two of the tcORFs are detected in a PaenibaciUus apairius isolate, 
DB482. 

Genomic DNA from various PaenibaciUus strains (or fromi?. coli to serve as anegative 
control) was prepared as described above in Example 12, and was digested with restriction 
enzyme to produce multiple fragments. A typical digestion contained 8 ug of DNA in a total 
volumeof400uLofreacnonbu^ 

England Biolabs, Beverly, MA). The reaction, containing 200 unite of enzyme, was incubated 
overnight at 37»C, then placed on ice. Digested DNA was former purified and concentrated by 
addition of 30 uL of 3M sodium acetate (pH5.2) and 750 uL of ice cold 100% ethanol, followed 
by centrifogation. The DNA pellet was washed twice with 70% ethanol, dried under vacuum, 
andresuspendedmSOulofTEbuff^tlOn^ 

acid (EDTA)]. An aliquotwas then analyzedby agarose gel electrophoresis for visual assurance 
of limit digestion. In a similar manner, DNA of IDAS 1529 cosmid SB12 was digested with 
EcoK I, and was used as a positive control for the hybridization experiments. 

EcoK I digested genomic DNA fragments to be blotted for Southern analysis were 
separated by electrophoresis through 0.7% or 1 .2 % agarose gels in TEA buffer (40mM Tris- 
acetate, 2mM EDTA, pH8.0) (1 u g DNA/well). On each gel, lanes containing a lkb DNA 
Molecular Weight Ladder (Invitrogen™, Carlsbad, CA) were used to provide molecular weight 
size standards. The 15 fragment sizes larger than 500 bp in this ladder (in kilobases) are: 12.2, 
11.2,10.1,9.2, 8.1, 7.1,6.1,5.1,4.1,3.1,2.0,1.6, 1.0, 0.52, and 0.50. The DNA in the gel was 
stained with 50 jig/mLethidium bromide, Ihe gel was photographed, andthentheDNAinthegel 
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was depurinated (5 min in 0.2M HC1), denatured (15 min in 0.5M NaOH, 1.5M NaCl), 
neutralized (5 min in 0.2M HC1) and transferred to MAGNA 0.45 micron nylon transfer 
membrane (Osmonics, Westborough, MA) in 2X SSC (20X SSC contains 3M NaCl, 0.3M 
sodium citrate, pH 7.0). The DNA was crosslinked to the membrane by ultraviolet light 
(Stratalinker®; Stratagene, La Jolla, CA) and prepared for hybridization by incubating at 60°C or 
65°C for 1 to 3 hours in "Minimal Hybridization" solution [contains 10% w/v polyethylene glycol 
(M.W. approx. 8000), 7% w/v sodium dodecylsulfate; 0.6X SSC, 5mM EDTA, 100 jig/ml 
denatured salmon sperm DNA, and 1 OmM sodium phosphate buffer (from a 1M stock containing 
95 g/LNaH 2 P0 4 * 1H 2 0 and 84.5 g/LNa 2 HP0 4 7H 2 0)]. 

DNA fragments of the tcORFs for use as hybridization probes were first prepared by 
Polymerase Chain Reaction (PCR) using SB 12 cosmid DNA as template (see previous 
examples). The forward and reverse primers for these amplifications are listed (5 1 to 3* directions 
of the respective DNA strands) in Table 24, below (bases in capital letters correspond to protein 
coding regions). Primer Set One is designed to amplify, from SB12 cosmid DNA, a DNA 
fragment that includes all of tcORF5, which is disclosed as SEQ ID NO: 10, and which has some 
similarity to the Photorhabdus tcaC gene (Table 6). Primer Set Two is designed to amplify, from 
cosmid SB12, a DNA fragment that encodes the protein disclosed as SEQ ID NO: 19. This DNA 
fragment and the encoded protein are somewhat longer than the DNA sequence of tcORF6 
disclosed as SEQ ID NO: 12, and the encoded protein disclosed as SEQ ID NO: 1 3. The proteins 
disclosed as SEQ ID NO: 1 3 and SEQ ID NO: 1 9 both have some similarity to the protein encoded 
by the Photorhabdus tccC gene (Table 6). The amplified PCR products were cloned into the 
pCR®2.1-TOPO® cloning vector (Invitrogen™, Carlsbad, CA), and fragments containing the 
tcORFs were released from the resulting clones by restriction enzyme digestion (listed in the 
Table below), followed by purification from agarose gels using the GenElute™ Agarose Spin 
columns (Sigma Chemical Co, St Louis, MO). Recovered fragments were concentrated by 
precipitation using the Quick-Precip™ Plus Solution according to the supplier's instructions 
(Edge BioSystems, Gaithersburg, MD). 



Table 24. 

PCR Primer Set One 

SB12 tcORF5 (SEQ ID No, 10) 

Forward Primer SB126* 
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gtacgtcalctagaaaggagatataccATGCCACAATCTAGCAATGCCGATATCAAGCTATTGTC 

^i^Sc^AGGC.K^GCAAATAA^TOAGTOATOG^ 
St tompCR*2.1 -TOP O" clone with So/ 1 + Xba I + Pvu I and punfy 4,368 bp ftagment 



[00310] 



S^^^GAAAATGATACCgTCMACTCAcCATrATTTGCTTCACC 
^SgS^CTCnCATTGAAAACCGGD^ 



might interfere with subsequent functional analysis of the clone. 

RadioactivelylabeledDNAfra^^ 
LabelingKit (Roche Diagnostics, Mannheim, Gemany) according to the supplier's iBStructions. 
Nonincorporated nucleotides were removed by passage through a QIAquick»PCR Purification 
column (Qiagen, Inc. Valencia, CA) according to the manufacturer's instructions. Labehng of 
approximately 100 ng of DNA fragments by these methods resulted in specific activities of 
approximatelyO.l uCi/ng. The labeled DNA fragments were denatured by boiling for 5 minutes, 
then added to the hybridization blot m Minimal Hybridization solution and m^^^ 
60°Cor65°C. Loose radioactivity was removed from the blot by rinsing at room temperature in 
2X SSC, then more tightly bound radioactivity was removed by washing the blot for at least one 
hour at 60°C or 65° C in 0.3X SSC + 0.1% sodium dodecylsulfate. At least two such washes 
were performed. The blot was placed on X-ray film at -80»C with two intensifying screens, and 
the exposed film was developed after 1 to 3 days exposure. Blots were stripped of hybridized 
DNA fragments by boiling for 10 minutes in 0.3 X SSC + 0.1% SDS, and reused once or twice 

for subsequent hybridizations. 
[mn] KstmctfragmentsmathybridizedtoprobesderivedfromPrimerSetsO^^ 

observed in genomic DNA obtained from Paenibacillus apairius strain DB482. The probe 
derived from Primer Set One (primers SB126 and SB127), which detects sequences homologous 
tothelDAS 1529tcORF5,hybridized to fragments of estimated sizes (inkilobases)of20, 10.2, 
and 8.4. Within this range of molecule sizes, mobilities of DNA fragments can provide only 
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estimations of true molecular sizes. Signal intensity for the fragments estimated to be 20 kb and 
8.4 kb were much more intense than the signal intensity for the fragment estimated to be 1 0.2 kb. 
Since each of these fragments is at least twice the size of the probe fragment (about 4.4 kb), one 
explanation for these results is that multiple copies of genes that are similar to the probe derived 
from IDAS 1 529 tc ORF5, and thus are similar to the Photorhabdus tcaC gene, are present in the 
genome of Paenibacillus apairiits strain DB482. However, other explanations for this outcome 
are possible. 

[00312] The probe derived from Primer Set Two (primers SB 128 and SB129), which detects 

sequences homologous to the IDAS 1 529 tcORF6 and its flanking 5' end sequences, hybridized to 
fragments of estimated sizes (in kilobases) of 7.8 and 4.5. Signal intensity for the fragment 
estimated to be 7.8 kb was very much more intense than the signal intensity seen for the fragment 
estimated to be 4.5 kb. One explanation for this result is that Paenibacillus apairius strain 
DB482 has a single gene similar to the IDAS 1 529 tcORF6 and its 5' flanking sequences, and 
thus is similar to the Photorhabdus tccC gene, and that EcoR I cleaves the gene into two 
fragments that have unequal portions of the DNA sequences comprising the gene. However, 
other explanations for this outcome are possible, including the presence of multiple genes with 
different amounts of absolute homology to the probe. 

[00313] These results (detection by PCR amplification followed by DNA sequence analyses) 

confirm the presence of relatives of the Photorhabdus tcaC and tccC genes in Paenibacillus 
apairius strain DB482. 

Example 15 - Insecticidal activity of DB482 
[003H] Paenibacillus strain DAS 1 529 has been shown to produce an extracellular protein that is 

toxic to Lepidopteran insects and has also been shown to contain a cry gene, designated as 
cry 1529, As this strain produces an extracellular insecticidally active protein and intracellular 
insecticidally active proteins, the subject invention includes screening other strains of 
Paenibacillus for extracellular (released into culture supernatant fluid) and/or intacellular (cell- 
associated) insecticidally active agents. This example illustrates how one can produce 
fermentation broths of Paenibacillus strains, how to process these broths, and how to test 
samples derived from these broths for insecticidal activity. 
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00315] 



[003161 



15 A Production and processing of Pacubacillus fermentation broths 

Paenibacillus strains were grown on nutrient agar plates (8 g/lnutrientbroth, 15 g/lBacto 
agar- Difco Laboratories, Detroit, MI) for 3-5 days at tfC. A single colony was picked and 
mo culatedintoa500 ml tnbaffled flask containing 100 ml of sterile modified tryptic soy broth 
(tryptone 10- gA, peptone 7 gA, soytone 3 gA, KC1 5 gA, K 2 P0 4 2.5 g/1; Difco Laboratory 
Detroit, MI). Following 72 hours of incubation at 28»C on a rotary shaker at 150 rpm, tine 
culturesweredspensed^^ 

minutes at4°C. After centrifugation, the supernatant fluid was decanted and filtered through a 
0 22 urn membrane filter (Millipore Corporation, Bedford, MA). The culture filtrate was then 
concentrated 20X using a Cenbicon Plus-20 cenbimgal filter device with a 5,000 molecular 
weightcutoffmembranebycentrifugmgat4,000xg.Thebacteri^ 

10mMpotassiumphospbatebuffer(pH=8). These samples were then tested in insect bioassay 
for insecticidal activities contamed m me processed supernatant and cell pellet sanmle^ 

15.B. Insect bioassay of processed supernatant and cell pellets 

The insect species included in these assays were Diabrotica undecimpunctata hovardi 
(Sou*emco m rootworm,SC^ 

(tob acco budworm, TBW) The artificial diet used to rear and bioassay SCR was described 
previously (Rose, RL. and McCabe, JM. 1973. , Econ. Entomol. 66, 398-400). Standard 
artificial lepidopteran diet (Stoneville Yellow diet) was used to rear and bioassay ECB, CEW 
andTBW Forty ul aliquots of the concentrated supernatant or cell pellet samples were appbed 
directly to the surface of wells (~1 .5 cm 2 ) containing the artificial diet. Treated diet wells were 

hatched from surface-sterilized egg, Assay trays were then sealed, placed in a humimfied 

^wfcchamber.andmam^^ 

were then scored. Eight insects were used per treatment 



[00317] 



15.C. Insecticidal activity of MM** mg2 ^ ^^4,1 activity 

Concentrated supernatant and cell pellets from stiara uiwi « 

_*« SCR, TBW, and.CEW relative «, control treatments (Table 25.) It is possible Use. the 
taedciW activity associated with concerted supernal and cell pdta ftotn DB482 are 
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the result of two different insecticidal factors, one that is cell-associated (£.& Cry-like) and 
another that is released from the cells (z.e. TC-iike). However, it is also possible that the 
insecticidal activities from both the concentrated supernatant and cell pellets from DB482 are the 
result of the same insecticidal factors being present in both cellular fractions. 



Table 25. Insecticidal activity of DB482 



Insects Tested 


Concentrated Supernatant activity 


Cell pellet activity 


SCR 


+++* 


t l 1 


raw 


++ 


++ 


CEW 


+++ 


-H- 


Medium controls 







* ++, +++; no, moderate, and high activity, respectively 



15.D. Summary of Insecticidal Activity Screening 
[00318] This example illustrates a method for screening concentrated culture supernatants and cell 

pellets from Paetiibacillus strains to identify strains possessing insecticidal activity against 

Coleopteran and Lepidopteran insects. DB482, which is an isolate of Paenibacillus apiaritis was 

shown herein to contain homologues of tcaA, tcaB, tcaG, and tccC. The finding of insecticidal 

activity in DB482 confirms that Paenibacillus strain DAS1529 is not unique within the genus 

Paenibacillus with regard to producing insecticidal activities against Lepidopteran insects. 

Therefore, the subject invention includes methods used to identify other strains of Paenibacillus 

with insecticidal activities against Lepidopteran insects in other species of Paenibacillus such as 

P. chondroitinus,P. alginolyticus, P. larvae, P. validus,P. gordonae,P. alvei 9 P, lentimorbus, P. 

popilliae, P. thiaminolyticus, P. curdlanolyticus, P. kobensis, P. glucanolyticus, P. lautus, P. 

chibensis, P. macquariensis, P. azotofixans, P. peoriae, P. polymyxa, P. illinoisensis, P. 

amylolyticus, P. pabuli, P. macerans. 
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Claims 

A method of screening a culture of a Paeiiibacillus isolate for a gene that encodes a 
protein selected from the group consisting of a Cry protein that is toxic to a 
lepidopteran pest and a toxin complex protein, wherein said method comprises at 
least one ofthe following steps: 

(a) obtaining DNA from said culture and assaying said DNA for the presence 
of said gene; and 

(b) obtaining protein produced by said culture and assaying said protein for 
the presence of a protein that indicates the presence of said gene in said 
isolate. 

2. AmemodofscreenmgaculmreofaPa^Wtoisolateforaproteinm^^ 
activity againstalepidopteranpestwherein said method comprises at least one ofthe 
following steps: 

(a) obtaining culture broth produced by said culture and assaying said broth 
for toxin activity against a lepidopteran pest; and 

(b) feeding a plurality of said isolates to a lepidopteran pest and observing 
said pest for effects of a toxin. 

3. The method of claim 1 wherein said method comprises screening a collection of 
Paenibacillus isolates for said protein, and said isolate is in said collection. 

4. The method of claim 1 wherein said protein is a toxin complex protein. 



5. 



Themethod of claim 4 wherein said protein enhances Ihe activity of a toxin complex 
toxin protein. 



6. The method of claim 1 wherein said protein is a Cry protein that is toxic to a 
lepidopteranpest. 
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7. The method of claim 1 wherein said step of obtaining DNA from said culture 
comprises creating a library of clones from said DNA and assaying at least one of 
said clones for the presence of said gene. 

8. The method of claim 7 wherein said step of assaying said clone for the presence of 
said polynucleotide comprises assaying said clone for lepidopteran toxin activity, 
thereby indicating the presence of said polynucleotide. 

9. The method of claim 1 wherein said step of assaying said DNA comprises performing 
polymerase chain reaction with at least one primer that is designed to indicate the 
presence of said gene. 

10. The method of claim 1 wherein said step of assaying said protein comprises 
immunoreacting an antibody with said protein wherein said antibody is designed to 
indicate the presence of said protein. 

1 1 . The method of claim 1 wherein said step of assaying said DNA comprises hybridizing 
a nucleic acid probe to said DNA wherein said probe is designed to indicate the 
presence of said gene. 

12. An isolated protein that has toxin activity against an insect pest wherein said protein 
is encoded by a polynucleotide sequence that hybridizes with the complement of a 
sequence selected from the group consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 
32, 34, 35, 38, and 40. 

1 3 . The protein of claim 12 wherein said protein is a Cry protein and said probe is the 
complement of SEQ ID NO: 14. 

14. The protein of claim 12 wherein said protein is a toxin complex protein and said 
probe is the complement of a sequence selected from the group consisting of SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 32, 34, 35, 38, and 40. 



s 
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15. An immunoreactive fragment of a protein according to claim 12. 

16. An isolated polynucleotide that encodes a protein according to claim 12. 

17. A cell comprising a polynucleotide according to claim 1 6. 

18. The cell according to claim 1 7 wherein said cell is selected from the group consisting 
of a plant cell and a microbial cell. 

19. A method of controlling an insect pest wherein said method comprises the step of 
contacting said pest with a protein comprising an amino acid sequence selected from 
the group consisting of SEQ BDNOS: 3, 5, 7, 9, 11, 13, 15, 18, 19, 33, 36, 37, 39, and 
41. 

20. The method of claim 1 wherein said Paenibacillus isolate is of a species selected 
from the group consisting of P. apiarius, P. chondroitinus, P. alginolyticus, P. larvae, P. 
validus, P. gordonae, P. alvei, P. lentimorbus, P. popilliae, P. thiaminolyticus, P. 
curdlanolyticus, P. kobensis, P. glucanolyticus, P. lautus, P. chibensis, P. macqiiariensis, P. 
azotofixans, P. peoriae, P. polymyxa, P. illinoisensis, P. amylolyticus, P. pabuli, and P. 
macerans. 

21 . The method of claim 1 1 wherein said probe is derived from a gene selected from the 
group consisting of tcaA, tcaB, tcaC, tcbA, tccA, tccB, tccC, tcdA, tcdB, xptAl, 
xptDl, xptBl, xptCl, xptA2, sepA, sepB, and sepC. 

22. The method of claim 9 wherein said primer is derived from a gene selected from the 
group consisting of tcaA, tcaB, tcaC, tcbA, tccA, tccB, tccC, tcdA, tcdB, xptAl, 
xptDl, xptBl, xptCl, xptA2, sepA, sepB, and sepC. 

23 . The method of claim 9 wherein said primer is selected from the group consisting of 
SEQ ID NOS: 22, 23, 24, 25, 26, 27, 28, 29, 30, and 31. 
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24. A biologically pure culture of a Paenibacillus strain selected from the group 
consisting of DAS 1529 (available under NRRL B-30599) and DB482 (available 
under NRRL B-30670). 
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2THIA 
or 

AAC44156: DITLKVAIYPYVPDPARFQA 
I I I I I I I I I I I I I I I III! 
SEQIDNO:17: DITLKVAIYPYVPDPSRFQA 



Fig. 3 
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SEQ ID NO: 9 {top) x SEQ ID NO: 5 (bottom) 

1 MTKEGDKHMSTSTLLQSIKEARRDALVNHYIANQVPTALADKITDADSLY 50 

I I I I I : I : I I I I I I I I III llllllllllll II I Ml H 

1 MTKEGGKNMSTSTLLQLIKESRRDALVNHYIANNVPRELTDKITDADSLY 50 

* * * 

51 EYLLLDTKISELVKTSPIAEAISSVQLYMNRCVEGYEGKLTPESNTHFGP 100 

| 1 | | | | I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I ' I MM 
51 EYLLLDTKISELVKTSPIAEAISSVQLYMNRCVEGYEGKLTPEGNSHFGP 100 

101 GKFLYNWDTYNECRFSTWAGKERLKYYAGSYIEPSLRYNKTDPFLNLEQSI 150 

lllllllllllllllll M Ml M 111111111:1 
101 GKFLNNWDTYNECRYSTWAGKERLBCYYAGSYIDPSLRYNKTDPFLNLEQNI 150 

151 SQGRITDDTVKNALQHYLTEYEVLADLDYISVNKGGDESVLLFVGRTKTV 200 

I I 1 I I 1 I I M 1 I 1 1 I 1 1 1 1 1 1 1 M 1 M MIMM Mill lllllll: 

151 SQGRITDDTVKNALQHYLTEYEVLADLEYISVNKGADESVLFFVGRTKTM 200 

201 PYEYYWRRLLLKRDNNNKLVPAVWSQWKKISANIGEIAVDSYVVPRWHKNR 250 

I 1 | 1 1 1 I I J I I r 1 I I I I 1 I I I lllllll IIMIIhMII :ll:ll 
201 PYEYYWRRLTLKKDNNNKLVPAIWSQWKKITANIGEAVNNYWLHWHNNR 250 

251 LHVQWCSIEKSENDAGEPIEKRYLNDWFMDSSGVWSSFRKIPWEKSFEY 300 

Mill I II M M I I I I I I I I I I M II M M I II II I : Mil I 
251 LHVQWGSTEKTQNDDGEPIEKRYLNDWFMDKSSVWSSFRKVSYIENSFTY 300 

301 LDGSLDPRFVALVRNQILIDEPEIFRITVSAPNPIDANGRVEVHFEENYA 350 

III : : I I I | : | | | | : : : : 

301 TEGIIDSRNITIAGNQLFCDDSNTFKATITA. LPFDQIRVYLEKIYGTGG 349 

351 NRYNITIKYGTTSLAIPAGQVGHPNISINETLRVEFGTRPDWYYTFRYLG 400 

: :: ::|: : : I: : I I : : 

350 SiTVTGENKGYIIKVGEPREVSFSPNTLLDVFIGSNASPRDPYFKATFNR 399 

401 NTIQNSYGSIVNNQFSPPSGSNIKGPIDLTLKNNIDLSALLDESLDALFD 450 

lllllll: II I I I I I I I I I I I I I II I I I I I I I M I Mlhlll 
400 EALQNSYGSIKINQYTPPSGSNIKGPIDLTLKNNIDLSALLEESLDVLFD 449 

451 YTIQGDNQLGGLAAFNGPYGLYLWEIFFHVPFLMAVRFHTE — QRYELAERW 500 

llllhllllll M 1 I I I 1 1 I M I II I II I I I M I II M I I I I I I I II 
450 YTIQGNNQLGGLEAFNGPYGLYLWEI FLHVPFLMAVRFHTE ] [QRYELAERW 499 

501 FKFIFNSAGYRDDYGSLLTDDKGNVRYWNVIPLQEDTEWDDTLSLATTDP 550 

I I I 1 I 1 I I I I I I : I I : I II 11 1 I M 1 I I I I I I I 1 I 1 I I I 1 I 1 1 I I ■ I I ■ 

500 FKFIFNSAGYRDGYGNLLTDDKGNVRYWNWPLQEDTEWDDTLSIiATTDP 549 

551 DE I AMAD PMQ YKLAI FI HTMDFL I S RGDSL YRMLERDT LAEAKMY YI QAS 600 

II II II II M M M Ml II I 1 1 1 1 I I II I I 1 1 I M M I MIMIMII 

550 DEIAMADPMQYKLAIFIHTLDFLISRGDSLYRMLERDTLTEAKMYYIQAS 599 

601 QLLGPRPDIRLNHSWPNPTLQSEADAVTAVPTRSDSPAAPILALRALLTG 650 

I MUM II IMMMI IN llllllllllll I Ml II III I I MM 
600 QLLGPRPEIRINHSWPDPTLQSEADAVTAVPTRSDSPAAPILALRALLNA 649 



Fig. 7A 
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651 ENGHFLPPYNDELFAFWDKIDLRLYNLRHNLSLDGQPLHLPLFAEPVNPR 700 

« 1 M 1 1 1 1 1 1 1 1 | llllllllllllllilllllliniMii inn, 

650 ENGH FL PP YNDELLAFWDKI DLRL YNLRHNLS LDGQ PLHL PLFTE PVNPR 699 

701 ELQVQHGPGDGLGGSAGSAQSRQSVYRFPLVIDKARNAANSVIQFGNALE 750 
, nn "Mil I INI II III 1:111 III) I MM II I I! 111:111 HIM || 

700 ELQVQHGAGDGLGGSAGSVQSRQSVYRFPLVIDKARNAASSVIQFGNALE 749 
751 NALTKQDSEAMTMLLQSQQQIVLQQTRDIQEKNLAALQASLEATMTAKAG 800 

„ cn II "I Ml III MM I II II II Mill MINIM I III II I MUM I 

750 NALTKQDSEAMTMLLQSQQQIVLQQTRDIQEKNLASLQASLEATMTAKAG 799 
801 AESRKTHFAGLADNWMSDNETASLALRTTAGIINTSSTVPIAITGGLDMA 850 

rtA I ' N I M 1 1 M I M II : I M M M M M I I I I I II M I M M I M II M 

800 AKSRKTHFAGLADNWMSHNETASLALRTTAGIINTSSTVPIAITGGLDMA 849 

851 PNI FGFAVGGSRWGAASAAVAQGLQIAAGVMEQTANI IDISES YRRRRED 900 
OCrt ■ 1 ' ' I I I 1 I I I I | | I | | I | I | I | | | | ) | | | | | | 1 | )J | , j ) , , , , , | f 

850 PNIFGFAVGGSRWGAASAAVAQGLQIAAGVMEQTANIIDISESYRRRRED 899 
901 WLLQRDVAENEAAQLDSQIAALREQMDMARKQLALAETEQAHAQAVYELQ 950 

I ■ I I I I I M 1 1 1 1 I 1 1 1 1 1 I 1 1 I | 1 1 1 1 1 1 f | | | | | | 1 1 || , | , | , | , , 

900 WLLQRDVAENEAAQLDSQIAALREQMDMARKQLALAETEQAHAQAVYELL 949 

951 STRFTNQALYNWMAGRLSSLYYQMYDAALPLCLMAKQALEKEIGSDKTVG 1000 
I I I M M M 1 1 I I I I I I I I I M I I M I I I II I I I I I I | M II I I : I I I I I 
STRFTNQALYNWMAGRLSSLYYQMYDAALPLCLMAKQALEKEIGNDKTVG 



950 
1001 
1000 
1051 



999 
1050 



VLSLPAWNDLYQGLLAGEALLLELQKLENLWLEEDKRGMEAVKTVSLDTL 

I ' I I 1 1 1 ! I I 1 1 M M H M II I 1 1 I I II II II I I II I 1 : 1 1 I M II 

IFTLPAraDLYQGLLAGEALLLELQKLENLWLEEDKRGMEAVRTVSLDTL 1049 



LRKTNPNSGFADLVKEALDENGKTPDPVSGVGVQLQNNIFSATLDLSVLG 1100 
IN Mill II : - 1 I I I 1 I I I ? I r | 1 I ) I 1 r I I M I i I ! II 
1050 LRKEKPESGFADFVK. . EVLDGKTPDPVSGVSVQLQNNIFSATLDLSTLG 1097 
• 

1101 LDRSYNQAEKSRRIKNMSVTLPALLGPYQDIEATLSLGGETVALSHGVDD 1150 
HI INN! Mill I I I I 1 1 1 1 1 1 I I I I I I I I ] | | 1 I 1 1 | I | I | | | 
LDRFYNQAEKAHRIKNLSVTLPALLGPYQDIAATLSLGGETVALSHGVDD 1147 

SGLFITDLNDSRFLPFEGMDPLSGTLVLSIFHAGQDGDQRLLLESLNDVI 1200 
1 1 1 1 1 I 1 1 1 1 M I M I 1 1 M M I 1 1 M Ml MM I MM III MINI I 

SGLFITDLNDSRFLPFEGMDPLSGTLVLSILHAGQDGDQRLLLESLNDVI 1197 



1098 
1151 
1148 



1201 FHIRYVMK* 1209 

IMIIMM 

1198 FHIRYVMK* 1206 
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80% 
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1. 2. 3. 4. 5. 

Lane 1. Digestion C: 3.5 Trypsin, 4 mg/mL 
Lane 2. Digestion B: 35 uL Trypsin, 4 mg/mL 
— , - -r-i-'.i - , L1 ^ Lane 3. Digestion A: no Trypsin 

~50kDa-> >— - Lane 4. Start material: 500 uLCRYl 529 protein, 1 13 u^mL 

Lane 5. Molecular weight markers 
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Purified 

Cryl529; 
no tag 

RARA 



Whole Cell 



MW 



Ciyl529-His 6 Cryl529-H^ Cry^-Hi^ Ciyl529-His 6 

RARA HANA QANA RARS 



MW 
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501 



tcaA2-1529 
tcaAl-1529 
tcaA-W14 



tcaA2-1529 
tcaAl-1529 
tcaA-W14 



550 



( 500 ) AGTCGCTGATCCTCAATAATGACAATATGAACC6AGAGGTAT |CTTCTCT6 
(485) AGTCGCTGATCCTCAATAATGACAATATGAACCGTGAGGTGT pTTCCCTG 

(485.) aggatctgatattaagcgaaacgacgatgaataaagaggtcaJcttccctt 



551 



, , 600 

( 5 5 0 ) GATATCCTTCTGGATGTGC TfeCAGCCCGAAGGCTCTGACACGCTGACATC 
(535) GATATCCTGCTGGATGTGCT( GCAGTCCGAAGGCTCCGGCACAnTnnraTr 

( 5 3 5 ) |GATATCTTGTTGGATGTGCT^ CAAA AAGGCGGTAAAGATATTACTGA 

SB105 



Fig. 11A 



1901 



tcaA2-1529 
tcaAl-1529 
tcaA-W14 



tcaA2-1529 
tcaAl-1529 
tcaA-W14 



1950 



( 1824 ) GATGCGCAGCACGGGTCTTTCCTTTGAGCA G|TTGGATTGGCTGATTGCCA 
(1788) GGCGCGCAGCACGGGACTTTGCTTTGAGCAG T TGGAT T GGCTGATTACCA 
(1809) ATCATCCCAGACCGGGCTATCATTTGAAGAA fTTGGACTGGCTGATTGCCA 

SB106 

2000 



1951 



(1874) fiTGCCAGCCGTb CCGTTATCGAACACGGTGGAGAGCTTTTTCTfifinTaAC 
(1838) ATACCAGCCGT GCCGTAATCGAACATGGTGGAGAACTGATTCTGGATAAG 
(1859) ^^^^^^AGTGTGCCGGACCACCACGACAAAATTGTGCTGGATAAG 



Fig. 11B 
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1651 1700 
tcaB2-1529 (1611) GGAGTGGGATGACACGTTGTCCCTGGCAACGACCGACCCGGACGAGATtE 

tcaB-W14 ( 1557 ) CGCATGGGATACCACACAGCCC GCCACCACTGATCCAGATGTGATC G 

tcaBl-1529 ( 1 608 ) GGAGTGGGATGACACGTTGTCCCTGGCAACGACCGACCCGGACGAGATT| 



1701 



1750 

tcaB2-1529 (1661) |CGATGGCCGACCCGATGCAATACAAG CT GGCf TATATT TAT TCACACC ATG 
tcaB-W14 (1604) CTATGGCGGACCCGATGCATTACAAGCTGG"C GATATTCCTGCATACCCTT 
tcaBl-1529 (1658) ICGATGGCCGACCCGAT GCAATACAAGCTGGq TAT CTTTAT TCACACCT T G 



SB101 



Fig. 12A 



tcaB2-1529 
tcaB-W14 
tcaBl-1529 



tcaB2-1529 
tcaB-W14 
tcaBl-1529 



2051 , 2100 

(1999) TGGGACAAAATCGATC TGCGT TTATACAAT T T G ^GCCACAATTTGAGT CT| 
(1951) TGGGATAAACTTGAGTTACGCCTATACAACCTG pGCCACAATCTGAGTCT^ 

(1996) t^g;ataaaatcgacctgcgtctctacaattta |cgccacaatctgagccti 

SB102 

2101 2150 

(2049) |ggacggtcagccgct| tcatttgccgctctttgccgaaccggtcaatccgc 
(2001) ggatggtcaaccgct aaatctgccactgtatgccacgccggtagacccga 

(2046) bgACGGTCAGCCGCTj rCATTTGCCGCTCTTTACCGAACCGGTCAATCCTC 



Fig. 12B 



2901 ^ [ 2950 

tcaB2-1529 (2845) rTnrAAflfiCAnriCGCTTTACGAATCAAGCTT lTGTATAACTGGATGGCfGGj 
tcaB-W14 (2797) CTGCAAACCACTCGTTTTACCGGGCAGGCAC ITGTATAACTGGATGGCCGG| 
tcaBl-1529 (2842) CTGCTAAGCACCCGTTTTACGAATCAAGCTT fTGTATAACTGGATGGCCGG| 

SB103 

2951 3000 
tcaB2- 152 9 (2895) |ACGTCTGTC| GTCTCTATACTATCAAATGTATGACGCCGCATTGCCGCTCT 
tcaB-W14 (2847 ) TCGTCTCTC CGCGCTCTATTACCAAATGTATGATTCCACTCTGCCAATCT 
tcaBl-1529 (28 92 ) 1aCGTCTGTC| GTCTCTATACTATCAAATGTATGACGCCGCATTGCCGCTCT 



Fig. 12C 
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2201 



2250 



tcaC-1529 (2125) AGCTCGTTTT6GTAC CGAAGCTCCACCCAGTATTGGCTGGATGAGAAA| CA 
tcdBl-W14 (2149) CACACCTTGCATTAC CGTAGCTCTGTCCAGTTCTGGCTGGATGAAAAA| GC 
tcdB2-W14 (2128) CACACCTTGCGTTAC CGCAGTTCCTCCCAATTCTGGCTGGATGAAAAa| gC 
xptCl-Xwi (2188) ACCACGCTGTATTAT ^GCAGCTCTGCCCAGTTCTGGCTGGATGAGAAAf TT 

SB215 



Fig. 13A 



tcaC-1529 
tcdBl-W14 
tcdB2-W14 
xptCl-Xwi 



(4263) 
(4380) 
(4362) 
(4428) 



4501 



A jTGGT T T ACAGTAAACGAAGATGAAAATGATA 
G|TGGTTTACCGTGAGTGAGGATGAGAATGATA< 



rGGTTTACTGTCAATGAAGATGAAAATGACA( 



rGGTTTGTGGTGAATGAAGATGAAAATGAC A( 
SB217 



4550 

atggactcatcattat 
;gccg-ctgatgcgctg 
gccg-ctgaggtgaag 
gccg-gtgaaatgaca 



Fig. 13B 
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1051 



1100 



tccC-1529 
tccCl-W14 
tccC2-W14 
tccC3-W14 
tccC4-W14 
tccC5-W14 
xptBl-Xwi 



(1027) TAT |CGTTATGAATATGATCCGGTAGGCAATfi TCCTTTCTATTTACAATGA 
(973) CTA CGCTATAAGTATGATCCGGTGGGGA^ GTTATCAGTATCCATAATGA 
(940) TTA SGCT AT CAATATGACCCGGT AGGC AA' ~ ~ ™ ~ ^ - -™ 



(961) CTG CGCTATGAATATGACCCGGTAGGCAA1 
(961) CTA CGCTATGAACATGATCCTGTAGGGAA 1 ! 
(955) CTA CGCTATCAATATGACCCAGTAGGCAAT 
(973) CTG CGTTATGAATATGATCCTGTCGGAAAT 
SB212 



TGATCAATATCCGTAATGA 
TCATCAGCATCCGTAATGA 
.TTATTAGTGTCCGTAATGA 
TCATTAGTATCCGTAATGA 
TGCTGAAATCAACTAATGA 



Fig. 14A 



1951 



2000 



tccC-1529 (1903) TACAAGACCATC jCGCTATTCAGGCAAAGAGCGGGATG CCACAGGCCTGTA 

tccCl-Wl4 (1861) TACAAATTTATT bGTTACTCCGGTAAAGAGCGGGATGjC CACTGGATTGTA 

tccC2-W14 (1807) TATAAAACGATT CGCTATT'CCGGCAAAGAACGAGATG CCACCGGGTTGTA 

tccC3-W14 (1849) TATAAAACTATC CGTT ATTCAGGCAAAGAGCGGGATG CCACCGGGCT ATA 

tccC4-W14 (1849) TACAAAATCCTC CGTTACTCAGGTAAAGAACGCGATG' CTACCGGGCTCTA 

tccC5-W14 (1822) TATAAAACCATT CGTTATTCTGGTAAAGAGCGGGATG TTACCGGGCTGTA 

xptBl-Xwi (1855) TACAAAACCGTG CGTTATTCTGGCAAAGAGCGGGATq CAACAGGGTTGTA 

SB213 



Fig. 14B 
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SEQUENCE LISTING 
<110> Dow AgroSciences LLC 

<120> Pesticidally Active Proteins and Polynucleotides Obtainable from 
Paenibacillus Species 

<130> DAS-101XC2 

<150> US 60/392,633 
<151> 2002-06-28 

<150> US 60/441,647 
<151> 2003-01-21 

<160> 49 

<170> Patent In version 3.2 

<210> 1 

<211> 33521 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Nucleic acid sequence of the entire insert of SB12. 

<400> 1 



gatcacacgg ccggcgtatt 


ccggctcgga accgaagaat taacagaagc 


gcttcagcag 


60 


tccggttatc ggacagtctt 


tgatattgca tctgaaaatc ttgcggaatt 


tcagaaaagc 


120 


aatccggaga ttccctcttc 


cgacgcgaag gagattcatc aattagccgt 


ccagaggaca 


180 


gaaaacttat gcatgcttta 


taaggcctgg caactgcaca atgatccggt 


cgtccagagc 


240 


cttcccaaat tatccgcgga 


taccggcctg cgaggcatgc gtgccgcgtt 


ggagcggagt 


300 


cttggagggg gagccgattt 


tggagacttg ttcccggagc gatcgccaga 


gggctatgcg 


360 


gaagcctcct ctatacagtc 


gcttttttcg ccgggacgtt accttacggt 


gctgtataaa 


420 


attgcgcagg atctccacga 


cccaaaagac aaactgcata ttgacaaccg 


ccgtccagat 


480 


ttgaagtcgc tgatcctcaa 


taatgacaat atgaaccgtg aggtgtcttc 


cctggatatc 


540 


ctgctggatg tgctgcagtc 


cgaaggctcc ggcacactga catccctgaa 


ggatacctac 


600 


tatccgatga cccttcccta 


tgatgacgac cttgcgcaaa tcaatgccgt 


ggcggaggcg 


660 


cgctcatcca atttgctggg 


gatctgggat accctgctgg acacgcagcg 


gacttccatc 


720 


ctgcaggatt ccgccgctgt 


ccaccggata agcaagccgc ggcactcggc 


atacgtcaat 


780 


cagagagtct ccgatgatga 


accggtattg atcgcgggag aggaattcta 


cttggagacc 


840 


ggcggtgttg ccgacacgac 


cccgtctccg ccaacgaggg aagcgctttc 


cttgacgcca 


900 


aacagcttcc gtctgctggt 


caaccccgag ccgacagcag acgacatcgc 


caatcactac 


960 


aacgttaaga ctcaagatcc 


tgccgctctg gccgccgtct taaatgtggt 


cgatgacttt 


1020 
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tgcctgaaaa ccggtttgag ctttaatgag ttgctggact taacgatgca gaaggatgat 1080 

gaatcgatcg gcagcgagta caaaagccgg tttgtaaaat ttggcggcga ggccaatgtt 1140 

ccggtttcaa cctatggagc tgtatttctg acaggaacgg aagaaactcc gttgtgggta 1200 

ggaaaaggag ctgtgataag ccctgcagcg gacgcctatg ttcgtaatgg gacatatgca 1260 

aacacgaatt atggatcaga cactagtctt gttgtgaagc aggatgggtc tagtggatac 1320 

agtagggaag catatatcag gtttgatttg acaggtcttt ccggagttgt ggaggaagct 1380 

aaaatttctc taacaactag agcgaaacaa ttgtctagct taagacacca agctcatttg 1440 

gtcagtgaca acagttggga tgaattgaaa atcacatgga ataacaaacc tgcaggagga 1500 

gcgatcatcg caagctggga tgttcccgaa gttggtgaga atgtaaaggt tgatgtgacc 1560 

cggcaagtaa atgatgcgct cgcaaacggt caagataaac tatcaattgt tattcgttct 1620 

agtgcaaatt atggcagtct gggcgatgtc tcttatgcct ctagagaaca ccctgaaaaa 1680 

gcctcacgac cttctatgga aatcaaggcg ataacgggtg ctggtttaaa ttttacggcg 1740 

gataatgttg tagctctggc aggaagggcg gaaaagcttg tccggctggc gcgcagcacg 1800 

ggactttcct ttgagcagtt ggattggctg attaccaata ccagccgtgc cgtaatcgaa 1860 

catggtggag aactgattct ggataagccg gtactggagt ctgtggccga attcacaagg 1920 

ctccataagc gttatggcat cacagcggat atgttcgccg cgtttatcgg cgaagtcaat 1980 

acgtatgctg aagcaggtaa agagagcttt tatcagacga ttttcagcac ggccgaccat 2040 

tcggctgcct tacctttagg cgcaactttg caatttgagg tgagcaaaca ggatcgatat 2100 

gaagcgattt gctgcggggc catgggggtg accgccgatg agttctctcg tatcggcaaa 2160 

tactgctttg gcgacaacgc gcagcaagtt accgccaatg aaacaaccgt tgcgcagctt 2220 

tatcgtttag gccgaattcc tcacatgctt ggattgcgtt ttaccgaggc ggagctgttg 2280 

tggaaattga tggctggcgg cgaggatacc ttgctccgca cgattggcgc gaagcctcgc 234 0 

agtttacaag ccttagagat tattcgccgt actgaggtcc ttttggactg gatggatgct 2400 
catcagcttg atgttgtctc cctgcaagcc atggttacca atcggtacag cggcacagcc 2460 
acgccggagc tgtacaactt tttggcacag gtgcaccaat ccacaagcag tgccgcgaac 2520 
gtgtccaaag cggatgctca ggataccctg cccgcggaca agctgttccg ggccttggcg 2580 
gtaggcttca acctgaaggc caacgtgatg gcgcaggtca ttgactggtt ggacaaaacc 2640 
gacggagcgt ttacgctgcg ggctttctgg gacaagcttc aagcgtattt cagcgccgat 2700 
catgaagaag aactgacggc cttggaagga gaagccgact tgctgcagtg gtgccaacag 2760 
atcagccagt atgcgctcat tgtccgctgg tgcgggttaa gcgatcagga tctggcgctg 2820 
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ctgaocgggc atcccgggca gcttctgtcc ggacaacata cggtgccggt accctcgctg 2880 
catctcctgc tggtgctgac ccgcctgaag gaatggcagc agcgcgtcca ggtttccagc 2940 
gaggaggcca tgcgctattt tgcccaggcc gatgcgccaa ccgtcacacg cgatgctgcg 3000 
gtcaagctgc ttgcccgtat ccatggctgg aatgaacagg ataccgcctc gatgaatgac 3060 
tacctgctgg gagagaacga atatcctaag aactttgagc agatctttac tttggaaagc 3120 
tgggtcaacc tgggccgtca actgaatgtt ggcagccgaa cgttgggaga gctggttgac 3180 
atgtcagaag aggatgatac cgcggaaaac acggatttga ttatctcggt cgcccaaagc 3240 
ctgatggctg cggtgcaggc ctgaaccaac atgaccaagg aaggtggtaa gaatatgtct 3300 
acttcaaccc tgttgcaatt gattaaggaa tcccgccggg atgcgttggt caaccattat 3360 
atcgccaaca atgtcccgag agagcttacg gataagatta cagacgcaga cagcctgtat 3420 
gagtatttgc tgctggatac caagatcagt gaactogtaa aaacatcgcc gatagctgag 3480 
gccattagca gcgttcagtt atacatgaac cgatgcgtgg aaggctatga aggcaagctg 3540 
actccggaag gcaaoagoca tttogggccg ggaaaattcc tgaataattg ggatacctat 3600 
aacaagcgtt attccacttg ggccggcaag gaacgtctga aatattatgc aggcagttat 3660 
attgacccgt ccttgcgcta taacaaaacg gatccgttcc tgaacctgga acagaatatc 3720 
agccagggaa gaatcaccga tgacaccgta aagaacgcgc tgcaacacta cctgactgaa 3780 
tatgaagtgt tggcggattt ggaatatatc agcgtaaata aaggcgccga tgaaagtgta 3840 
ttattcttcg taggccgcac caaaacaatg ccatacgaat attactggcg ccgattaacg 3900 
ttgaaaaagg acaataacaa taaactggtg cctgccatct ggtctcaatg gaaaaaaata 3960 
actgccaata tcggcgaagc agttaataat tatgtggtgc ttcactggca taataaccgc 4020 
ttacatgtac aatggggttc tacagagaaa acacaaaatg atgacggaga acccattgag 4080 
aaacgatatt tgaatgactg gttcatggat aagtccagtg tctggtcttc attccgaaag 4140 
gtttcatata tagaaaatag ttttacttat actgagggca tcattgattc aagaaatatt 4200 
actatagctg gaaatcaact gttctgtgat gattcaaata cttttaaggc aacaataacg 4260 
gcacttccat ttgaccaaat acgtgtttac ttagaaaaga tttacggtac aggcggcagc 4320 
atcacggtta ctggagaaaa taaaggctat attattaagg tgggggagcc aagagaagtc 4380 
agtttctctc ctaatacgtt actagatgta ttcataggta gtaatgcaag ccctcgagao 4440 
ccatatttca aagctacatt taatagagaa gctctccaaa attcatacgg ctcaattaaa 4500 
ataaatcaat acacccctcc ttctggaagc aatatcaaag gtcctatcga ccttaccctg 4 56 0 
aaaaataaca tcgacotgtc ggcgttgttg gaagagagcc ttgacgtact gttcgactat 4620 
accattcagg ggaataacca attgggcggc ttagaggcct ttaacgggcc ttacggactt 4680 
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tatttgtggg 


aaatcttcct 


ccatgttcca 


tttttaatgg 


cggttcgctt 


ccacaccgag 


4740 


cactgagaga 


tcccctcata 


atttccccaa 


agcgtaacca 


tgtgtgaata 


aattttgagc 


4800 


tagtagggtt 


gcagccacga 


gtaagtcttc 


ccttgttatt 


gtgtagccag 


aatgccgcaa 


4860 


aacttccatg 


cctaagcgaa 


ctgttgagag 


tacgtttcga 


tttctgactg 


tgttagcctg 


4920 


gaagtgcttg 


tcccaacctt 


gtttctgagc 


atgaacgccc 


gcaagccaac 


atgttagttg 


4980 


aagcatcagg 


gcgattagca 


gcatgatatc 


aaaacgctct 


gagctgctcg 


ttcggctatg 


5040 


gcgtaggcct 


agtccgtagg 


caggactttt 


caagtctcgg 


aaggtttctt 


caatctgcat 


5100 


tcgcttcgaa 


tagatattaa 


caagttgttt 


gggtgttcga 


atttcaacag 


gtaagttagt 


5160 


tgctagaatc 


catggctcct 


ttgccgacgc 


tgagtagatt 


ttaggtgacg 


ggtggtgaca 


5220 


atgagtccgt 


gtcgagcgct 


gattttttcg 


gcctttagag 


cgagatttat 


acaatagaat 


5280 


ttggcatgag 


attggattgc 


ttttagtcag 


cctcttatag 


cctaaagtct 


ttgagtgact 


534 0 


agatgacata 


tcatgtaagt 


tgctgatagg 


tttccagttt 


tccgctcctia 


ggtctgcata 


5400 


ttgtactttt 


cctcttactc 


gacttaacca 


gtaccaaccc 


agcttctcaa 


cggatttata 


5460 


ccatggcact 


ttaaagccag 


catcactgac 


aatgagcggt 


gtggtgttac 


tcggtagaat 


5520 


gctcgcaagg 


tcggctagaa 


attggtcatg 


agctttcttt 


gaacattgct 


ctgaaagcgg 


5580 


gaacgctttc 


tcataaagag 


taacagaacg 


accgtgtagt 


gcgactgaag 


ctcgcaatac 


5640 


cataagccgt 


ttttgctcac 


ggatatcaga 


ccagtcaaca 


agtacaatgg 


gcatcgtatt 


5700 


gcccgaacag 


ataaagctag 


catgccaacg 


gtatacagcg 


agtcgctctt 


tgtggaggtg 


5760 


acgattacct 


aacaatcggt 


cgattcgttt 


gatgttatgt 


tttgttctcg 


ctttggttgg 


582 0 


caggttacgg 


ccaagttcgg 


taagagtgag 


agttttacag 


tcaagtaagg 


cgtggcaagc 


5880 


caacgttaag 


ctgttgagtc 


gttttaagtg 


taattcgggg 


cagaattggt 


aaagagagtc 


5940 


gtgtaaaata 


tcgagttcgc 


acattttgtt 


gtctgattat 


tgatttttgg 


cgaaaccatt 


6000 


tgatcatatg 


acaagatgtg 


tatctacctt 


aacttaatga 


ttttgataaa 


aatcattagg 


6060 


ggattcatca 


gcaccgagca 


gcggtatgag 


ttggcggaac 


gatggtttaa 


attcattttc 


6120 


aacagcgcag 


gttaccgtga 


tggctacggc 


aatctgctga 


cggatgacaa 


aggcaacgtg 


6180 


cgctactgga 


acgtcgtgcc 


tctgcaggag 


gatacggagt 


gggatgacac 


gttgtccctg 


6240 


gcaacgaccg 


acccggacga 


gattgcgatg 


gccgacccga 


tgcaatacaa 


gctggctatc 


6300 


tttattcaca 


ccttggactt 


cttgatcagc 


cgcggcgaca 


gcttgtaccg 


gatgctggag 


6360 


cgggatacct 


tgaccgaagc 


gaagatgtat 


tacattcagg 


ccagccaact 


gcttgggcct 


6420 


cgtcccgaga 


tccggatcaa 


tcacagctgg 


cctgatccga 


ccctgcaaag 


cgaagcggac 


6480 
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gcggtaaccg ccgtgccgac 
gcgcttctga atgcggaaaa 
ttctgggata aaatcgacct 
cagccgcttc atttgccgct 
catggggcag gcgatggatt 
tatcgttttc ctctggtcat 
gggaatgccc tggaaaacgc 
caatcccagc agcagattgt 
tcgctgcaag caagtctgga 
acccattttg ccggcctggc 
ctgcgtacca ctgcgggaat 
ggcttggata tggctccgaa 
gccagcgcgg ctgtagccca 
aatatcatcg atatcagcga 
gatgttgccg agaatgaagc 
atggatatgg cgcgaaaaca 
gtctacgagc tgctaagcac 
cgtctgtcgt ctctatacta 
aaacaggctt tagagaaaga 
gcctggaatg atttgtatca 
aagctggaga atctgtggct 
tctttagata cccttctccg 
gaagttctgg acggaaagac 
aatattttca gtgcaaccct 
gcggaaaagg cccacaggat 
tatcaggata ttgcggcaac 
gtggatgaca gcggcttgtt 
ggtatggatc ctttatccgg 
gaccagcgcc tcctgctgga 
aaatagaaga caaactcccg 
aatcattgac ataaatatac 



gcgaagcgat tcgccggcag cgccaattct cgccttgcga 6540 
cgggcatttc ctgccgcctt ataatgatga actattagct 6600 
gcgtctctac aatttacgcc acaatctgag cctggacggt 6660 
ctttaccgaa ccggtcaatc ctcgtgagct gcaggttcag 6720 
agggggaagc gccggttccg tccaaagccg tcaaagtgtc 6780 
cgataaggcg cgcaatgccg cgagtagtgt tatccaattc 6840 
gctgacaaag caggacagcg aggccatgac tatgctgttg 6900 
cctgcagcaa acccgcgata ttcaggagaa gaacctggcc x 6960 
agcaacgatg acagccaaag cgggcgcgaa atcccgaaag 7020 
ggataactgg atgtcgcata atgaaaccgc ctcacttgca 7080 
tatcaataca agctcgaccg tgccaatcgc tatcactggc 7140 
catttttggt ttcgcagttg gaggttcccg ctggggagca 7200 
aggattgcaa atcgccgccg gcgtaatgga acagacggcc 7260 
aagctaccgc cggcgccggg aggattggct gctgcagcgg 7320 
ggcgcagttg gattcgcaga ttgcggccct gcgggaacag 7380 

acttgcgctg gcggagacgg aacaggcaca cgcgcaagcg 7440 

ccgttttacg aatcaagctt tgtataactg gatggccgga 7500 

tcaaatgtat gacgccgcat tgccgctctg cttgatggcc 7560 

aatcggcaat gataaaacgg ttggaatctt caccctcccg 7620 

gggattgcta gcgggcgagg cgctgctgct cgagcttcag 7680 

ggaggaggac aagcgcggaa tggaagctgt aagaacggta 7740 

caaagaaaag ccagaatccg gttttgcaga tttcgtcaag 7800 

gcctgaccct gtaagcggag ttagcgtaca gctgcaaaac 7860 

tgacctgtcc acccttggcc tggatcgctt ttacaaccaa 7920 

caaaaacctg tcggttacct tacccgcgct attgggacct 7980 

cttatcgcta ggtggcgaga ccgttgcgct ttcccatggc 8040 

tatcacggat ctcaacgaca gccgtttcct gcctttcgag 8100 

cacactcgtt ctgtcgatac tccatgccgg gcaagacggt 8160 

aagcctgaac gacgtcatct tccacattcg atatgtcatg 8220 

cgaaatagtt caaccgcggg agttctttat tttccaccca 8280 

tttaataata tgttggagga agaagaggag gttgttattg 8340 
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gtgagaataa agaaaatgtt tgaagtagcg atgatgttat cattggcgtg tttgtttttt 8400 

gttacatcag ctgcttcagc aaaaacaact aatttaactt cttccccgaa acttatgaac 8460 

tgctttgatg tagctggtaa cgtaacctac aaaaccgctc ctgatggttc aattacgaaa 852 0 

ataatcgaag tacaagatgt tagtaaattc agtgaacaaa ccaatctcag gttggctcca 8580 

gattctaaag ttaccattta tattcctgac tccagtgaca ataaccagcc caattactcc 8640 

aataataatt ccaatgacta tagtgaacaa aattacctca atactaaccc caacgttgaa 8700 

ccatttgttg aaccatttgt tgaacttata aagtatattg cgaacgtaag tgatccatat 8760 

gaagcgtgtg gatcaaaatc gattagagat tctgattatg atcctcccgg cggaaaaatg 882 0 

ataataaaac aagggataca ggctacacac tcaaccacgg tttctatcga tgccaaaatc 8880 

gtttcaactg ccttaaaata tgatgtaaca acgagttatt ccattgaaga ggagcaaaat 894 0 

attaaagtac cagacaataa aagaggaaga attattgctt atccaaagta tgatgtcaac 9000 

acctttgaaa tatgggaagc tggtctaata tataataaaa agattggaga cggtacggct 9060 

ttctatccta aaggagtatg ttttgttaca attattaatt aactttcaaa taagagaagc 9120 

tgtttcttaa gaataaagaa gcagcttctt gcatttttta ttatgatatt acactctatc 9180 

ttctgtcaga tgctccctct caacttccat tcccaatccc ctcttcttaa aaggccataa 9240 

aagttacact tatcatttcc gtcattgcta atctaccttg cagttaacct aaaatatacc 9300 

ttccgggatt cctgaaggat gaaacatatt ttcacccatc agtgaaacta tctatgcttt 9360 

tttgattgaa gcgagaggta tgcttgggtt gtaaatgaaa ggggggacct gttcatggga 9420 

aactacctaa aacggtaacg gatagcttct atggttttga tgtctgcagt cattttgtct 94 80 

tggggagtac tea teat tea gaacacaaga ggaggagttc atggtgtcaa caacagacaa 9540 

cacggccggc gtattcegge teggaacega agaattaaca gaagegctta ageagtcegg 9600 

ttateggace gtctttgata ttgtatctga caatcttgeg gaatttcaga aaaacaatcc 9660 

ggagattccc tcttctgacg cgaaggagat tcatcaatta gccgtccaga ggacagaaaa 9720 

ettatgeatg ctttataagg cctggcagct gcacaatgat ccggttgtcc agagccttcc 9780 

caaattatcc gcggataccg gectgeaagg catgcgtgcc gcgttggagc ggagtcttgg 9840 

aggeggagee gattttggag acttgttccc ggagegateg ccagagggct atgeggaage 9900 

ctcctctata cagtegcttt tctcgccggg acgttacctg acggtgctgt ataaaattgc 9960 

gegggatetc cacgacccaa aagataaact gcatattgac aaccgccgtc cagatttgaa 10020 

gtegctgate ctcaataatg acaatatgaa ccgagaggta tcttctctgg atatccttct 10080 

ggatgtgctg cagcccgaag gctctgacac gctgacatcc ttgaaggata cctaccatcc 10140 
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gatgaccctt ccctatgatg acgaccttgc gcaaatcaat gccgtggcgg aggcgcgttc 10200 
atctaatttg ctggggattt gggataccct gctggacacg cagcggactt ccatcctgca 10260 
gaattccgcc gctgcccgcc ggataagcaa ggcgcggcac tcggcatacg ccaatcagaa 10320 
agcctccaat gatgagccgg tattcatcac gggagaggaa atctacctgg aaaccggagg 10380 
taaacggctt tttctggcgc ataaactcga gataggttca actattagcg ctaaaatcaa 10440 
cattggaccg ccgcaagcgg ccgatatcgc gccggcaaag ttgcaactcg tatattacgg 10500 
cagaggcggc agagggaact acttcctgcg cgtggcagac gatgtgtccc tcggtggaaa 10560 
gctgctgacc aattgttatc tgaccagcga tgacggacag agcaacaata ttagcgggcc 1062 0 
atactgccta atgatcaacc gaggcaccgg cagcatgcct agcgggactc accttccagt 10680 
tcagattgaa agagtgaccg atacatccat ccgcattttt gtgccggatc acggctattt 10740 
ggggctaggc gaaagccttg ccagcaactg gaatgaaccg ttggcgctga atctgggctt 10800 
ggatgaagcg ttgaccttta ccttgagaaa gaaggagacg ggaaatgaca ccatttccat 10860 
aatcgacatg ctgccgccgg tagcgaacac gactccgtct ccgccgacga gggaaacgct 10920 
ttccttgacg ccaaacagct tccgtctgct ggtcaaccct gagccgacag cggaggacat 10980 
cgccaagcac tacaacgtca cgacggtaac ccgggctcct gccgatctgg cctccgcctt 1104 0 
aaatgttgtc gatgatttct gcttgaaaac cggtttgagc tttaacgaat tgctggattt 11100 
aaccatgcag aaggattatc agtcaaaaag cagtgagtac aaaagccgat ttgtaaaatt 11160 
cggcggcggg gagaatgttc cggtatcaag ctatggcgca gcctttctga caggagcgga 1122 0 
agatactcct ttgtgggtga aacagtataa cagcgtgggg actgcaacaa gcacccctgt 11280 
tttaaacttt acgccagata atgttgtggc tttggcagga agggcggaaa agcttgtccg 1134 0 
gctgatgcgc agcacgggtc tttcctttga gcagttggat tggctgattg ccaatgccag 11400 
ccgtgccgtt atcgaacacg gtggagagct ttttctggat aagccggtac tggaagctgt 11460 
ggccgaattc acaaggctca ataagcgtta tggcgtcaca tcggatatgt tcgccgcgtt 11520 
tatcggcgaa gtcaatacgt atacagaagc gggcaaggac agcttttatc aggcgagttt 11580 
cagcacggcc gaccattcgg ctaccttacc tttgggcgct tctttgcaac ttgaggtgag 1164 0 
caagcaggat cgatatgaag cgatttgctg cggggctatg ggggtgaccg ccgatgagtt 11700 
ctcccgtatc ggcaaatact gctttgggga taaagcacag caaatcacgg ccaatgaaac 11760 
aaccgttgcc cagctttatc gtttaggccg aattcctcat atgctaggct tgcgttttac 11820 
cgaggcagag ctgttgtgga aattgatggc tgggggcgag gataccttgc tccgcacgat 11880 
tggcgcgaac cctcgcagtt tagaagcgtt agagattatt cgccggacgg aggtcctttt 1194 0 
ggactggatg gatgcccatc agctggatgt tgtctccctg caagccatgg ttaccaatcg 12000 
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gtacagcggc acagccacgc cggagctgta caattttttg gcacaggtgc atcaatccgc 12060 
aagcagtgcc gcgaacgtgg ccagagcgga tggtcaggat acgttgcctg cggacaagct 12120 
gctccgggca ttggcggcgg gcttcaaact gaaagccaac gtgatggcgc gagtaatcga 12180 
ctggatggac aaaaccaata aagcgtttac gctgcgggct ttctgggaca agcttcaagc 1224 0 
gtatttcagc gccgatcatg aagaagaact gaccgccctg gaaggagaag ccgcaatgct 12300 

gcagtggtgc cagcagatca gccagtatgc gctcattgtc cgctggtgcg ggttaagcga 12360 

gcaggatctg gcgctgctga ccgggaatcc ggagcagctt ctggacggac aacatacggt 12420 

gcccgtaccc tcgctgcatc tcctgctggt gctgacccgc ctgaaggaat ggcagcagcg 12480 

cgtccaggtt tccagcgagg aggctatgcg ctattttgcc caggccgatt cgccaaccgt 12540 

cacgcgcgac gatgcggtta atctgcttgc ccgtatccat ggctggaatg aagcggatac 12600 

cgtctcgatg aatgactacc tgctgggaga gaacgaatat cctaagaact ttgatcagat 12660 

ctttgcactg gaaagctggg tcaacctggg ccgtcaactg aacgtgggca gcagaacgct 1272 0 

gggagagctg gttgacatgg ctgaagagga taaaaccgcg gaaaacatgg atctgattac 12780 

ttcggtggcc catagcctga tggctgcagc gaaagcctga accaacatga ccaaggaagg 12840 

tgataagcat atgtctactt caaccctgtt gcaatcgatt aaagaagccc gccgggat^c 12900 

gctggtcaac cattatattg ctaatcaggt tccgacagcg cttgcggaca agattacgga 12960 

cgcggacagc ctgtatgagt acttgctgct ggataccaag atcagtgaac tcgtaaaaac 13020 

atcgccgata gcggaggcca tcagcagcgt gcagttatac atgaaccgct gcgtcgaagg 13080 

ctatgaaggc aagttgactc cggaaagtaa tactcatttt ggcccaggta aatttctata 1314 0 

taactgggat acgtacaaca aacgtttttc cacctgggca ggaaaagaac gcttgaaata 13200 

ttatgcaggc agctatattg agccgtcctt gcgctacaac aaaaccgatc cattcctgaa 13260 

cctggaacag agcatcagcc agggaagaat tactgatgat accgtaaaga acgcgctgca 13320 

acactacctg actgaatatg aagtgttggc ggatctggat tatatcagcg ttaataaagg 13380 

cggcgacgaa agtgttttac tctttgttgg acgcaccaaa accgtaccgt atgaatacta 13440 

ctggcgccgt ttgcttttaa aaagggacaa taataataag ctagtaccag cagtctggtc 13500 

tcagtggaaa aaaatcagtg ccaatatcgg tgaagcggtt gatagttatg tggtgcctcg 13560 

gtggcataaa aaccggctac atgtgcaatg gtgttctata gagaaaagtg aaaatgatgc 13620 

cggtgaaccc attgagaaac gatatttgaa tgactggttc atggatagtt ccggagtctg 1368 0 

gtcttcattt cgaaagattc cggttgtgga aaagagtttc gaatatttgg acggaagcct 13740 

cgatccccga tttgtcgctc ttgttagaaa tcaaatatta attgatgagc cagaaatatt 13800 



WO 2004/002223 



PCT/US2003/020082 



9 

cagaattaca gtatcagccc ctaatccgat agatgcaaat ggaagagtag aggtacattt 13860 
tgaagaaaac tatgcaaaca gatataatat taccattaaa tatgggacaa cgagtcttgc 13920 
tattcctgca gggcaggtag ggcatccaaa tatctctatt aatgaaacat taagggttga 13980 
attcggcacc aggccggatt ggtattatac tttcagatat ttaggaaata caatccaaaa 14040 
ctcatacggt tcaattgtca ataatcaatt ttcacctcca tcaggaagca atattaaagg 14100 
tcctatcgac cttaccctga aaaataacat cgacctgtcg gccttgttgg atgagagcct 14160 
tgacgcactg ttcgactata ccattcaggg cgataaccaa ttgggcggct tagctgcctt 14220 
taacgggcct tacggacttt acttgtggga aatcttcttc catgttcctt ttttaatggc 1428 0 
ggfctcgcttc cacaccgagc agcggtatga gttggcggaa cgttggttta aattcatctt 14340 
caacagcgca ggataccgtg atgattacgg cagtctgctg acggatgaca aaggcaacgt 14400 
gcgttactgg aacgtgatac cgctgcaaga ggacacggag tgggatgaca cgttgtccct 14460 
ggcaacgacc gacccggacg agattgcgat ggccgacccg atgcaataca agctggctat 14520 
atttattcac accatggact tcctgatcag ccgcggcgat agcttgtacc ggatgctgga 14580 
gcgggatacc ctggccgaag ccaagatgta ttacattcag gccagccaac tgcttgggcc 14640 
ccgccccgac atccggctca atcacagttg gcctaatccg accttgcaaa gcgaagcgga 14700 
cgcggtaacc gccgtgccga cgcgaagcga ttcgccggca gcgccaattt tggccttgcg 14760 
agcgcttctg acaggcgaaa acggtcattt cctgccgcct tataatgatg aactgttcgc 14820 
tttctgggac aaaatcgatc tgcgtttata caatttgcgc cacaatttga gtctggacgg 14880 
tcagccgctt catttgccgc tctttgccga accggtcaat ccgcgtgaat tgcaggttca 14940 
gcatggcccg ggcgatggct tggggggaag cgcgggttcc gcccaaagcc gtcagagtgt 15000 
ctatcgtttt cctctggtca tcgataaggc gcgcaatgcg gccaacagtg tcatccaatt 15060 
cggcaatgcc ctggaaaacg cactgaccaa gcaagacagc gaagcaatga ccatgctgtt 15120 
gcagtcccag cagcagattg tcctgcagca aacccgcgat attcaggaga agaacctggc 1518 0 
cgcgctgcaa gcaagtctgg aagcaacgat gacagcgaaa gcgggggcgg agtcccggaa 1524 0 
gacccatttt gccggcttgg cggacaactg gatgtcggac aatgaaaccg cctcactcgc 15300 
actgcgtacc accgcgggaa tcatcaatac cagctcaacc gtgccgatcg ccatcaccgg 15360 
cggcttggat atggctccga acatttttgg tttcgcagtt ggaggttccc gctggggagc 15420 
agccagcgcg gctgtagccc aaggattgca aatcgccgcc ggcgtaatgg aacagacggc 15480 
caatattatc gatattagcg aaagctaccg ccggcgccgg gaggattggc tgctgcagcg 1554 0 
ggatgttgcc gaaaatgaag cggcgcagtt ggattcgcag attgcggccc tgcgggaaca 15600 
gatggatatg gcgcgcaagc aacttgcgct ggcggagacg gaacaggcgc acgcgcaagc 15660 
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ggtctacgag ctgcaaagca cccgctttac gaatcaagct ttgtataact ggatggctgg 15720 

acgtctgtcg tctctatact atcaaatgta tgacgccgca ttgccgctct gcttgatggc 15780 

gaagcaggct ttagagaaag aaatcggttc ggataaaacg gtcggagtct tgtccctccc 15840 

ggcctggaat gatctatatc agggattatt ggcgggcgag gcgctgctgc tcgagcttca 15900 

gaagctggag aatctgtggc tggaggaaga caagcgcgga atggaagccg taaaaacagt 15960 

ctctctggat actcttctcc gcaaaacaaa tccgaactcc gggtttgcgg atctcgtcaa 16020 

ggaggcactg gacgaaaacg gaaagacgcc tgacccggtg agcggagtcg gcgtacagct 16080 

gcaaaacaat attttcagcg caacccttga cctctccgtt cttggcctgg atcgctctta 16140 

caatcaggcg gaaaagtccc gcaggatcaa aaatatgtcg gttaccttac ctgcgctatt 1620 0 

ggggccttac caggatatag aggcaacctt atcgctaggc ggcgagaccg ttgcgctgtc 16260 

ccatggcgtg gatgacagcg gcttgttcat cactgatctc aacgacagcc ggttcctgcc 16320 

tttcgagggc atggatccgt tatccggcac actcgtcctg tcgatattcc atgccgggca 16380 

agacggcgac cagcgcctcc tgctggaaag tctcaatgac gtcatcttcc acattcgata 16440 

tgttatgaaa tagctttaca gtcagatata ttccggggct tgtattcaca agcccctcca 16500 

aggaggaatt gggttatgcc acaatctagc aatgccgata tcaagctatt gtcgccatcg 16560 

ctgccaaagg gcggcggttc catgaaggga atcgaagaaa acatcgcggc tcccggctcc 1662 0 

gacggcatgg cacgttgtaa tgtgccgctg ccggtaacct ccggccgcta tattactcct 16680 

gatataagcc tgtcctatgc gagcggccac ggcaacggcg cttatggaat gggctggacg 1674 0 

atgggagtga tgagcattag ccggagaaca agccgaggga cccccagtta tacatccgaa 16800 

gaccagttcc ttggtccgga tggggaggtg cttgttccgg aaagcaacga acaaggggag 16860 

atcattaccc gccacaccga tacggcccaa gggataccgt taggcgagac gtttacggtt 16920 

acacgctatt ttccccggat cgagagcgct tttcatttgc tggaatactg ggaagcgcaa 16980 

gcaggaagcg caacagcgtc gttttggctt attcactctg ccgatggagt gctgcactgt 17040 

ctgggtaaaa ctgctcaggc gaggatagcc gcccctgacg attccgccaa gatcgcagaa 17100 

tggctagtgg aggagtccgt ctcccccttc ggagagcata tttattacca atacaaagaa 17160 

gaagacaatc aaggcgtgaa tctggaggaa gacaatcatc aatatggggc gaaccgctat 17220 
ctgaaatcga ttcgctatgg aaataaggtt gcctctcctt ctctctatgt ctggaagggg 17280 
gaaattccgg cagacggcca atggctgtat tccgttatcc tggattatgg cgagaacgat 17340 
acctcagcgg atgttcctcc cctatacacg ccccaagggg agtggctggt gcgcccggac 17400 
cgtttttccc gctatgacta cggatttgag gtccggactt gccgcttgtg ccgccaggtc 17460 
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ttgatgttcc acgtctttaa ggagcttggc ggggagccgg cgctggtgtg gcggatgcag 17520 

ttggaatacg acgagaaccc ggcggcgtcc atgctgagcg cggtccggca attggcttat 17580 

gaagcagatg gggccattcg aagcttgccg ccgctggaat tcgattatac tccatttggc 17640 

atcgagacaa cggccgattg gcagcctttt ctgcctgtgc ctgaatgggc ggatgaagaa 17700 

cattatcagt tggtcgattt gtacggagaa ggcataccgg gcttattata tcagaacaat 17760 

gaccactggc attatcgttc gcccgcccgg ggcgacacac cggacgggat cgcctataac 17820 

agctggcggc cgcttcctca tatccccgtg aactcccgga acgggatgct gatggatctg 17B80 

aatggagacg ggtatctgga atggttgctt gcggaacccg gggttgcggg gcgctatagc 1794 0 

atgaacccgg ataagagctg gtccggtttt gtgccgctcc aggcactgcc aacggaattc 18000 

ttccatccgc aggcacagct tgccaatgtt accggatcgg gtttaaccga cttggttatg 18060 

atcggtccga agagcgtccg gttttatgcc ggagaagaag cgggcttcaa gcgcgcatgt 18120 

gaagtgtggc agcaagtggg cattactttg cctgtggaac gcgtggataa aaaggaactg 18180 

gtggcattca gcgatatgct gggatcgggt cagtctcatc tggtgcgcat ccggcatgat 1824 0 

ggcgttacat gctggcctaa tctggggaac ggcgtgttcg gggcgccgtt ggcccttcac 18300 

gggtttacgg catcggagcg ggaattcaat ccggaacgtg tatatcttgt ggaccttgat 18360 

ggatccggcg cttccgatat catttatgct tctcgtgacg ctctactcat ttaccgaaat 18420 

ctttccggca atggctttgc tgatccggtg cgggttccgc tgcctgacgg cgtgcggttt 18480 

gataatctgt gccggctgct gcctgccgat atccgcgggt taggtgtggc cagtctggtg 1854 0 

ctgcatgtac cttacatggc cccccgcagt tggaaattag atttctttgc ggcgaagccg 18600 

tatttattgc aaacggtcag caacaatctt ggagcttcca gctcgttttg gtaccgaagc 18660 

tccacccagt attggctgga tgagaaacag gcggcctcat cggctgtctc cgctttgccc 1872 0 

ttcccgataa acgtggtatc ggatatgcac acggtggacg aaatcagcgg ccgcaccagg 18780 

actcagaagt atacttaccg ccatggcgtg tatgaccgga ccgaaaagga atttgccgga 18840 

ttcggccgca ttgacacatg ggaagaggag cgggattccg aaggaaccct gagcgtcagc 18900 

actccgcccg tgctgacgcg gacctggtat cataccgggc aaaagcagga tgaggagcgt 18960 

gccgtgcagc aatattggca aggcgaccct gcggcttttc aggttaaacc cgtccggctt 19020 

actcgattcg atgcggcagc ggcccaggat ctgccgctag attctaataa tgggcagcaa 19080 

gaatactggc tgtaccgatc attacaaggg atgccgctgc ggactgagat ttttgcggga 19140 

gatgttggcg ggtcgcctcc ttatcaggta gagagcttcc gttatcaagt gcgcttggtg 19200 

cagagcatcg attcggaatg tgttgccttg cccatgcagt tggagcagct tacgtacaac 19260 

tatgagcaaa tcgcctctga tccgcagtgt tcacagcaga tacagcaatg gttcgacgaa 19320 
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tacggcgtgg cggcacagag tgtaacaatc caatatccgc gccgggcaca gccggaggac 19380 

aatccgtacc ctcgcacgct gccggatacc agctggagca gcagttatga ttcgcagcaa 19440 

atgctgctgc ggttgaccag gcaaaggcaa aaagcgtacc accttgcaga tcctgaaggc 19500 

tggcgcttga atattcccca tcagacacgc ctggatgcct tcatttattc tgctgacagc 19560 

gtgcccgccg aaggaataag cgccgagctg ctggaggtgg acggcacgtt acgatcttcg 19620 

gcgctggaac aggcttatgg cggccagtca gagatcatct atgcgggcgg gggcgaaccg 1968 0 

gatttgcgag ccctggtcca ttacaccaga agcgcggttc ttgatgaaga ctgtttacaa 1974 0 

gcctatgaag gcgtactgag cgatagccaa ttgaactcgc ttcttgcctc ttccggctat 198 00 

caacgaagcg caagaatatt gggttcgggc gatgaagtgg atatttttgt cgcggaacaa 19860 

ggatttaccc gttatgcgga tgaaccgaat tttttccgta ttctggggca acaatcctct 1992 0 

ctcttgtccg gggaacaagt attaacatgg gatgataatt tctgtgcggt tacatccatc 19980 

gaagacgcgc ttggcaatca aattcagatt gcatatgatt accgctttgt ggaggccatc 2004 0 

cagattaccg atacgaataa taatgtgaat caggtcgccc tggatgctct cggccgggtc 20100 

gtatacagcc ggacctgggg cacggaggaa gggataaaga ccggcttccg cccggaggtg 20160 

gaattcgcga cgcccgagac aatggagcag gcgcttgccc tggcatctcc cttgccggtt 20220 < 

gcatcctgct gtgtatatga tgcgcatagc tggatgggaa cgataactct tgcacaactg 20280 

tcagagcttg ttccagatag tgaaaagcaa tggtcgttct tgatagacaa tcgcttgatt 20340 

atgccggacg gcagaatcag atcccgcggt cgggatccat ggtcgcttca ccggctattg 20400 

ccgcctgctg tgggcgaatt gctgagcgag gcggaccgta aaccgccgca tacggtaatt 20460 

ttggcagcag atcgttaccc ggatgaccca tcccagcaaa ttcaggcgag catcgtgttt 2052 0 

agcgatggct ttgggcgtac gatacaaact gctaaaagag aagatacccg atgggcgatt 20580 

gcggaacggg tggactatga cggaaccgga gccgtaatcc gcagctttca gcctttttat 20640 

cttgacgact ggaattatgt gggcgaagag gctgtcagca gctctatgta cgcaacgatc 20700 

tattattatg atgctctggc acgacaatta aggatggtca acgctaaagg atatgagagg 20760 

agaactgctt tttacccatg gtttacagta aacgaagatg aaaatgatac catggactca 20820 

tcattatttg cttcaccgcc tgcgcggtga gatggaggtt aaacctatga acacaacgtc 2 088 0 

catatatagg ggcacgccta cgatttcagt tgtggataac cggaacttgg agattcgcat 20940 

tcttcagtat aaccgtatcg cggctgaaga tccggcagat gagtgtatcc tgcggaacac 21000 

gtatacgccg ttaagctatc ttggcagcag catggatccc cgtttgttct cgcaatatca 21060 

ggatgatcgc ggaacaccgc cgaatatacg aaccatggct tccctgagag gcgaagcgct 21120 
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gtgttcggaa agtgtggatg ccggccgcaa ggcggagctt tttgatatcg aggggcggcc 21180 

cgtctggctt atcgatgcca acggcacaga gacgactctc gaatatgatg tcttaggcag 21240 

gccaacagcc gtattcgagc aacaggaagg tacggactcc ccccagtgca gggagcggtt 21300 

tatttatggt gagaaggagg cggatgccca ggccaacaat ttgcgcggac aactggttcg 21360 

ccactacgat accgcgggcc ggatacagac cgacagcatc tccttggctg gactgccgtt 21420 

gcgccaaagc cgtcaactgc tgaaaaattg ggatgaacct ggcgactgga gtatggatga 21480 

ggaaagcgcc tgggcctcgt tgctggctgc cgaagcttat gatacgagct ggcggtatga 2154 0 

cgcgcaggac agggtgctcg cccaaaccga cgccaaaggg aatctccagc aactgactta 21600 

caatgacgcc ggccagccgc aggcggtcag cctcaagctg caaggccaag cggagcaacg 21660 

gatttggaac cggatcgagt acaacgcggc gggtcaagtg gatctcgccg aagccgggaa 21720 

tggaatcgta acggaatata cttacgagga aagcacgcag cggttaatcc gaaaaaaaga 21780 

ttcccgcgga ctgtcctccg gggaaagaga agtgctgcag gattatcgtt atgaatatga 21840 

tccggtaggc aatatccttt ctatttacaa tgaagcggag ccggttcgtt atttccgcaa 21900 

tcaggccgtt gctccgaaaa ggcaatatgc ctacgatgcc ttgtatcagc ttgtatctag 21960 

ttcggggcgg gaatccgacg cgcttcggca gcagacgtcg cttcctccct tgatcacgcc 22020 

tatccctctg gacgatagcc aatacgtcaa ttacgctgaa aaatacagct atgatcaggc 22080 

gggcaattta atcaagctta gccataacgg ggcaagtcaa tatacaacga atgtgtatgt 22140 

ggacaaaagc tcaaaccggg ggatttggcg gcaaggggaa gacatcccgg atatcgcggc 22200 

ttcctttgac agagcaggca atcaacaagc tttattcccg gggagaccgt tggaatggga 22260 

tacacgcaat caattaagcc gtgtccatat ggtcgtgcgc gaaggcggag acaacgactg 2232 0 

ggaaggctat ctctatgaca gctcgggaat gcgtatcgta aaacgatcta cccgcaaaac 22380 

acagacaacg acgcaaacgg atacgaccct ctatttgccg ggcctggagc tgcgaatccg 2244 0 

ccagaccggg gaccgggtca cggaagcatt gcaggtcatt accgtggatg agggagcggg 22500 

acaagtgagg gtgctgcact gggaggatgg aaccgagccg ggcggcatcg ccaatgatca 22560 

gtaccggtac agcctgaacg atcatcttac ctcctcttta ttggaagttg acgggcaagg 22620 

tcagatcatt agtaaggaag aattttatcc ctatggcggc acagccctgt ggacagcccg 22680 

gtcagaggta gaggcaagct acaagaccat ccgctattca ggcaaagagc gggatgccac 22740 

aggcctgtat tattacggac accgctacta tatgccatgg ttgggtcgct ggctgaatcc 22800 

ggacccggcc ggaatggtag atggactaaa cctgtaccgt atggtcagga acaatcctat 22860 

aggactgatg gatccgaatg ggaatgcgcc aatcaacgtg gcggattata gcttcgtgca 22920 

tggtgattta gtttatggtc ttagtaagga aagaggaaga tatctaaagc tatttaatcc 22980 
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aaactttaat 


atggaaaaat 


cagactctcc 


tgctatggtt 


atagatcaat 


ataataataa 


23040 


tgttgcattg 


agtataacta 


accaatataa 


agtagaagaa 


ttgatgaaat 


ttcaaaaaga 


23100 


cccacaaaaa 


gccgcacgga 


aaataaaggt 


tccagaaggg 


aatcgtttat 


cgaggaacga 


23160 


aaattatcct 


ttgtggcacg 


attatattaa 


cattggagaa 


gctaaagctg 


catttaaggc 


23220 


ctctcatatt 


ttccaagaag 


tgaaggggaa 


ttatgggaaa 


gattattatc 


ataaattatt 


23280 


attagacaga 


atgatagaat 


cgccgttgct 


gtggaaacga 


ggcagcaaac 


tcgggctaga 


23340 


aatcgccgct 


accaatcaga 


gaacaaaaat 


acactttgtt 


cttgacaatt 


taaatatcga 


23400 


gcaggtggtt 


acgaaagagg 


gtagcggcgg 


tcagtcaatc 


acagcttcgg 


agctccgtta 


23460 


tatttatcga 


aatcgcgaaa 


gattgaacgg 


gcgtgtcatt 


ttctatagaa 


ataatgaaag 


23520 


gctagatcag 


gctccatggc 


aagaaaatcc 


ggacttatgg 


agcaaatatc 


aaccgggtct 


23580 


tagacaaagc 


agcagttcaa 


gagtcaaaga 


acgagggatt 


gggaactttt 


tccgccggtt 


23640 


ttcaatgaag 


agaaagtagc 


atgtaactaa 


aattgctccc 


cattggttgt 


gtaaactaat 


23700 


ggggagttgt 


gattcactcc 


tgttcaacgc 


cattcatgta 


gaattgtttt 


gggaggttaa 


23760 


accgattgga 


tgccggcccc 


aaggcggagc 


tttttgatac 


cgaggggctt 


cgagtgtggc 


23820 


ttatcgatat 


caacggcaca 


cagacgactc 


tcgaatatga 


tgtattaggc 


aggcctgcag 


23880 


ccgtattcga 


gcatcaggaa 


ggcaaggaat 


ttcctaagtg 


ccgggatcgg 


tttagaatga 


23940 


gtctgatgcc 


aagccaacaa 


tttgcgaggg 


cagttgatgc 


gccactacga 


tacaatcccg 


24000 


ttacattcct 


tgataataag 


gggcttaaaa 


tcgtaattac 


cctaagtctc 


gtcgaggttg 


24060 


ctatagaatt 


gtatcgtctt 


catggtggcg 


ttcttttgct 


tcataatagt 


acgtgctgct 


24120 


agaattgtgc 


aggacgtcgc 


acattgctga 


tgtaaatgta 


tgtctttttc 


ttggaatagt 


24180 


agatccgctc 


cttgttctga 


tgtgttcatt 


ctactagccc 


ttattttttc 


tggccaacta 


24240 


agtcctatat 


ataattataa 


aaaaagcata 


gatatcttca 


tctataggtg 


aggatatcta 


24300 


tgcttttcat 


tttttgatta 


gagatatact 


tgtagtgcaa 


ggaaaagtag 


ataggagggt 


24360 


gaatttaaca 


gaagttacaa 


actgttgttt 


acttaaaaaa 


ttaatatgga 


gggaaataaa 


24420 


tatgaactca 


aatgaaccaa 


atttatctga 


tgttgttaat 


tgtttaagtg 


accccaatag 


24480 


tgacttggag 


aagtctggcg 


gtggagtagc 


gctagatgtt 


ggaatgtcat 


tgatatccga 


24540 


acttcttggt 


acggttccag 


ttgctggatc 


aattcttcaa 


tttgtattcg 


ataaattgtg 


24600 


gtttattttt 


ggcccttctg 


agtgggactc 


acttatggaa 


catgttgaag 


cattaattga 


24660 


tagtaaaata 


caagagcagg 


taaaaagaag 


tgcacaagat 


gaactaaatg 


caattacaaa 


24720 


taacttatct 


acgtatttga 


aatttctaga 


tgcatgggaa 


aatgattcta 


ataatttaag 


24780 
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agcgagagct gtagtgaaag accaatttgt aggccttgaa cagactcttg aaagaaaaat 24840 
ggttagtgtt tttggaagta cgggtcatga agtgcatctt ttgccaattt tcgctcaagc 24900 
agccaacctc cacctaattc tattaagaga tgctgagaaa tatggaaaga gatggggttg 24960 
ggcagataga gaaattcaag tatattatga taaccagatt cgttatatcc atgaatatac 25020 
ggaccattgt attaaatatt ataatcaagg attaagtaaa ctgaaaggtt ctacctatca 25080 
agattgggat aagtataatc gttttagaag agaaatgacc ctaactgttc ttgatttgat 2514 0 
ttcaattttc ccatcgtatg atactagaac ttacccaatt gatacaatag gtcaattgac 25200 
aagggaagtt tattcggatt tacttattgc taacccgtct gggatgcaga ctttcactaa 25260 
tgtagatttc gacaatattc ttattagaaa acctcattta atggatttct taagaactct 25320 
tgagattttt accgatcgac ataacgcaag cagacacaac gtatattggg gcggacatcg 25380 
agtgcattct tcttacacag gaggtaattt tgaaaatttt gaatctccct tatatggcag 2544 0 
tgaagcaaat gtagaacccc gaacatggtt gagttttgga gaatctcaag tctataatat 25500 
acgttcgaag cctgagtggg atagaggaag tactgcaatt agtggctcct atgaatttcg 25560 
aggagtgaca ggatgttctt tttatcgaat gggaaatttt gctggcaccg tagccctaac 25620 
ttaccgacag tttggtaacg aaggttctca aatcccattg cacaggctat gtcatgttac 25680 
ttattttaga agatctcaag ctgtgggggc gacttcgaga cagacgttaa caagtggtcc 25740 
gctattttcc tggacacata gtagtgctac ggaaacgaat atcattcacc cgacaaaaat 25800 
tacacaaata ccaatggtga aggctagttc ccttggatca ggtacttctg ttgtccaagg 25860 
accaggcttt acaggagggg atgtacttcg aagaaatagc cccggtagca caggaacttt 25920 
aagagttaac gtcaattcac cattatcaca gagatatcgt ataagaattc gttacgcttc 25980 
tactacggat ttagattttt ttgtcattcg cggaaatacg acagttaata attttagatt 26040 
tgggaacact atgcgtaaag gagaccctat aacctctcga tcatttagat ttgcggcttt 26100 
tagtacacca tttacttttg ctagctcaca ggatgaactt agaataaatg tacaaaattt 26160 
caataatggt gaagaagttt atatagatag aatcgaagtt attccagttt gatactacag 26220 
atgtttatgt tgatacccta ggtaatacac ctactgagac gttgggaaga agtgtaaaaa 26280 
actcatctca aaatcgaaag taaaagagcc cttcttaaaa ttttaagagg ggctcagctc 2634 0 
attgttgcac agttactcca gggtctgtca aggattttgc ttaaattgat ttatatgaac 26400 
ctaccccgag ggggagacga gcctcaaggc aaaatgctgg ccgatgcgtt ctggaaagaa 26460 
tacggagaac tgcagcagca ttttcggaca gcaggcggaa atagcctgag agaagacgat 2652 0 
gaggttgtcg acccaggtga tcaggatgtc ctggactccc cgattcttaa gatcgttcag 26580 
aatgctgagc cagaacttcg tggactcgtt ttccccgatc cacatgctca gtacgacctt 26640 
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attgccgtcc aaatcgatgc cgtacacatt ctggagatga tcttcaatct cccgtgtact 26700 

tacgtcctct tggcgtagag ggcgaatatc agtcctcgat gccggttacg ctcgtttgat 26760 

tcttcttcac aacgatgggc tcgaacttga ttaggcggtc tcgggaacgg aaatcgcctg 26820 

tacgccgtac tcacctggtg atcgtcttct tgctcttacc attgcggcta tttgccgtct 268B0 

gctggttctg cacatcatac ctctcataac ccagatgcgt gtcctccccc aacctttcta 2694 0 

gtcctttcat tgttagtgga tttgattgag tttacacgaa gtatttacga ctcttggaac 27000 

ougygtgttt tttgcaagtg aaaaatgctg agttttgctg caaaaagtgc caggctccct 27060 

tgccagccaa aacattactg cggaatgcgt tccattgcct gcttgaggcg gaaactctcg 27120 

cctgtgaacg agagaatatg cgcgtggtgc acgagacggt cgaccaaggc tgaggtgagc 27180 

ttagtatccc caaatatgga ggtccattgg ccaaactcca ggtactgggc tgtgggtttt 2724 0 

gcgcttgcga agggcggcgt tccagtcgct ctggtcggca tagcgtttgg ccgttcgcca 27300 

gtgaatgccc acctaccttg cgatttcgct cactgagcag ccttctgttt cgcgtaaaaa 27360 

agttgttgag gcattttcag catccttccc gtctccttcg tcaagttctc ggcaaaccta 27420 

acgataggag gattttaagg ggctgacaag tgcctttttt gcatctgact tcagcatttt 27480 

tacgctgcaa ttctaggcat ttttagtatg caataaacac ccgggtctcc atctctgaca 27540 

tgagcctgcc gggcactcat ataatataaa aagcatcaga gctgaatgag tcggctgctg 27600 

atgcttattt attggaccga acgatatctc tgcaagtaca agtacgaatg gagcccggta 27660 

ttgttgatac ccccgttttt ataaggatag actaagcagt tctttaatag actttaaatc 27720 

tttcaggatg agtttatctg gtatccaaat gacaggaatt tgagttttat tgattttgta 27780 

attggttata atgaaatcga atttgaccgt attgcttggt tcggtttgaa ggtttagaac 27840 

gttacccaat tcattattta atttgtgata gatgaatttt ttccaattgg ttggtccgga 27900 

taaaagcaat aaaacctttt ttttataatt tgacaggaga atagataaaa tatgaaacgt 27960 

gatgtggtag ctctcatctt cagataatct attttcattt atttctttat tccatgcgac 28020 

aataagaggg tgaatgatgt tgtagccctc tattaattca ttaggaatcg tcttaaatct 28080 

accccaatag tattctaacg gtaaattgag atttttaaat atatagatct tgtggaaaga 28140 

aaaatggatg ttatgtctta tttcttctat tactttggca ttcatattta cttcacttga 28200 

aatcatgtca attaaagcaa ggtgcttatc gtaaagttgt ttatatgatc cacgatagta 28260 

attcttgatt ttactttttt cattatcatc cccctcggag atgaggaaaa aaaatagtgt 28320 

aaaaaactta agttcatttt gctggaagct gaccttatag aaattttcaa tggcagagac 28380 

aatggcatag gtgcaatcta acttttgaga gctaaatatg tagtccatat taaaactttt 28440 
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ggaaataaaa tgcttgtata tatttctatt tacagacaca ttgatgagta tggttaatag 28500 

attataatcg atctccattt tatagaaacg aatatagttc tcaatttttt gttgcaagtc 28560 

taggtccggt ttaaagattt ctttatactt ttgcacgtga aaaaaaagaa ccatatagaa 28620 

gtatctaata ttcagttcct ttccaattat ttggatcgtt gttctgaaat taaaatcaag 28680 

gttaaaatgt ctaagtattt ttctgaaatt atttaatttt ttcttcaatg ttaatttatc 28740 

taagtacaat atttgagacc atttttccaa gctataatac ttctgattga atattcccat 28800 

taaaatgatg taaagttcac tgtttattat ataagtagct ataatagtgg aaaggtgatc 28860 

taaaggatct ttttttagaa cgtaaccttt ggaattaatg cttattatat cccaattatc 28920 

tggaagatcc tgttttaatt gagaaatgtc acttataatc gttctgcttg tacattgcag 28980 

cttacgggct aaagaactgg aagacacgat accttcgctg tcaattaagg attctaatat 2904 0 

ttggattttt cgaataattg aatgattttg aattaagtga gcggtgaatt catccattta 29100 

gtctccccct cgaaaagcga taactggtta aagtgcagat gtactatcct ttgttcgaag 29160 

ttgctagtaa tcaattctgc ggatctcttc aaagtgctcc aggtgctatg gatttacgaa 29220 

tttcccctaa acatatcttt ctaatgtttg tcttcttaat gaaaatcttt attatttttc 292 80 

tccttttcac aatttaaatt tcaattagaa agtaacgtaa ttttggaatg aaaaacgata 29340 

cataatttca ttaatagatg aaaatatgta tggtttttaa aatgtttgta aaggtcagct 294 00 

taatgcggtt aatagaaggt taagagattt tattgtgcaa ggggggacga agaaggatat 29460 

aagatagaac atatccgaaa gggaggaggc taacgaaatt tgaaattatt tgttgggtag 29520 

tgtttgcaaa actcgaatta tatggtaatc tatagatggt caattatttg caatgccaaa 29580 

ggaaggtgag gctgtgagta ccgttgctgc gaagccacag aggttggggg agctaataca 29640 

gt acta t egg cagaagaagg aattgagtct gtcgaagctg caagaagegg teggcattga 29700 

taaaggcagc ctgtcgagaa ttgaaaacag cgaggtcaaa cgccctgatt ttcgatccat 29760 

cttgtcgatc gecgeggtat tggacattcc ccatgacgcc ategtagaac agtacatcga 2982 0 

gateggacat aaatcggaag tcatatacac tattttacag aacgaattga caaccctcga 2988 0 

gcatccatcg ctcataccga aaattgeege aaagtttctt gaagcgccca atgaagacag 29940 

tetggatgea gtagagaagt tgtaccgaac gataggctcc gtgaatcatc cttccactca 30000 

gttatcttta tacaccctca tegtagacta ctcacgcgcc catggaatca tgecttacat 30060 

tgccaaagga ttatttcgga agtacatgat tgaacgaaat gatttcagca ggttgaagga 3012 0 

aacgtatcag gttggaaaaa atgtgctgga etatgetaat tttttgagcg agaaagagcg 30180 

gatacttcta tactatggct tgagtgttca cgcttatagt cttatgtttt ateatgatge 3024 0 

cataaaattc ggtaattatg ttgtggaaaa tggggaagaa gaacegctag caaatgeaac 30300 



X 
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tcataatgtt 


tgcaatgcct 


attaccattt 


ggggaattat 


gacgattgca 


atacctatct 


30360 


cgaaaagtat 


agtcatttcc 


catatccttt 


cgtcaaagag 


aatgttaaat 


taatgactgc 


30420 


ctttcttaac 


gggaagaaag 


ggaatattga 


gtccgccatt 


actcagttta 


ataactgttt 


30480 


agataccctg 


tcctcatata 


atttgattca 


tgccgtaact 


gaattaatgg 


aactatatct 


30540 


ccataaaaac 


gatcttgttg 


cagcagatca 


actccttatg 


tatgaagaac 


aaataattga 


30600 


gagcatcacc 


caaccacgaa 


ctacgccata 


taaaaggtca 


agattagcgc 


actactttcg 


30660 


catcaaagga 


caaULyutaa 


ct Cy taaaca 


dcatgaaaaa. 


gacgctgttg 


atagtttctt 


30720 


gaaaagtaca 


ttagagtacg 


taaaaattgg 


tcgttttacg 


gaagcttttg 


aatcattgtc 


30780 


gtttgtgacg 


cattcaatga 


tacataatca 


gtcaatcatt 


aatagtgaaa 


taattaaaaa 


30840 


ggttgataat 


attttacaaa 


taattgctgc 


aaaataatta 


aggaggaata 


tggtgccatg 


30900 


agaaaacgta 


aacttctttt 


cattgcctct 


cttctagttt 


ttggagcaat 


cagcatggag 


30960 


catattgttt 


catacatcga 


tgccccgtgg 


ataactaact 


tctaatgaat 


ataacatctc 


31020 


ataacgctag 


tcatggcccc 


gccattcgtt 


ggtggggtaa 


catccttcga 


aagcccgatt 


31080 


cttttactgg 


caatgttgct 


gcaattgtaa 


ggggcctcac 


tcactccaag 


tgctgtctct 


31140 


gcagatcctc 


tttggtttat 


gccatcggtc 


tatagcagca 


gtaactcgat 


cgacagataa 


31200 


aaatttgaaa 


attctcttcg 


aaaggagatt 


gacgcattaa 


gacgccttta 


tgggcgtttt 


31260 


tttgtttttt 


atgtataaag 


ttgttgcata 


ataatccaga 


atcaagtcat 


atttaccaac 


31320 


ttcatctata 


atcaggtcaa 


ataatacagt 


ttatggatga 


ggtgtggaaa 


tgagccatca 


31380 


accggtatac 


aaaggtgact 


taatcccttt 


tacttatagc 


tatgagaaag 


ctggctgttt 


31440 


aattgaaatc 


attgagagca 


ctcgacagga 


gcttatagta 


gcggccacgg 


aaaaaaagtg 


31500 


tcttaccgac 


gaaactgttg 


taagattaag 


ccaaaagtta 


gatacatacc 


tattggaatt 


31560' 


tcaaagaagg 


agctgcggct 


agaagaatag 


gatgtgataa 


tccggattat 


taggggaagg 


31620 


tgaggctgtg 


agtatcggtt 


ctgtgaagcc 


taaaagcttg 


ggggagttaa 


tcaagtatta 


31680 


tcgaaaaatg 


aatagcgaaa 


tcgcagatca 


gacattgaga 


ttgagcaaaa 


acctaaagtc 


31740 


attttctcaa 


tct tgcagaa 


ccraattaaaa 


acacfccgagc 


atccagcatt 


aatacccgaa 


31800 


atagccgcca 


aatttcttga 


cttatcaaat 


ggtatggagg 


gggtaaaaga 


gctttacaga 


31860 


gtaattgact 


cggtagaaga 


tacctccatt 


caattagctg 


tgtacaacct 


tattattgat 


31920 


tactctcgcg 


ctcatggaat 


gatgccctat 


attgctaagg 


gattataccg 


aaaatacatg 


31980 


atcgaacgaa 


atgatttcag 


caaattgaaa 


gaaacctatc 


aactaggtaa 


gtgtgtattg 


32040 


gactatatac 


agtttctggg 


cgacgaagaa 


cgaattgtat 


tttattattc 


aataggagtt 


32100 
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cacgctgaca gcttaatgaa ctatgatgac tctgtaccct atatgaaata tgtggtggag 32160 

aacgataact cagaaaatgg tgcctataga gcgaatgctt atcttagtct atgcaattct 32220 

tcctataata ctggtgatta caaagcaagc caggaatatt tagatgagta cagtaagtac 32280 

tctttttcat atgtagccga taatgtgcat tttatgtcgg cttgtataga gggcaagatg 32340 

gggaacgtgg atagttcgat ttcaaagtta cgttcttatt tacaaagttc atgggaagta 32400 

tcccgttgaa aagggaacct aaatatattc aaacgccttg tggtgtctca gattgttgaa 32460 

aaaccctgtc cttttctaaa ggtacagggt ttctcacatg aaaaggactc acgaaacgaa 32520 

ccttccgatg aacccggcca cggttttttc gtacgtttcc cgatccatcc ggtatgcttc 32580 

tccatgtccg gctttcggaa cgatgtatag ctctttctcg gcggggcaat tttcgtaaac 3264 0 

tttatgcacc atctccgtcg gcacaaaagt atcgttggcc ccatgaatga agagagtcgg 32700 

ggttttcgac tttttcacct gctccagtgc ggaggcttct ccgaaaaagt accctgcccg 32760 

tagcctggtc agcaggctgg tggtatccac gatcgggaat gccggaagat gatacatgcg 3282 0 

ccgcagctgg aaggaaagct gatccttcac agaggtataa ccacagtctt cgacgatggc 32880 

tttcacgttc ggcggcaaat tctcgccgct ggtcatcatc acggttgccc ctcccatcga 3294 0 

cacgccgtgg agaacgattt gtgaattcgt cccattcgtg tccaaaaccc gctgaatcca 33000 

tttcaaataa tccttacgct cgggccagcc gaaaccgata taatggcctt cgctttcccc 33060 

atgtccccga gcatcgggga gcaggatgtt gtagccccat ttctcgtggt acattctggc 33120 

gtaaccgctc atttgcgttg cgttcccgga atatccgtgc gcgatgatga ccgttttgtc 33180 

cgacggcttg gatgccggca aataatacgc cttcagatga atgccgtcat ccgagtccat 33240 

ctcccagcgc tcaaagcttt ggttgttcca ccattccttg tccgccagcg tggtttgctt 33300 

tgattcctcc acctcaggac tggttttcag gtcgggattg tcgctcaaaa agtctttcga 33360 

agcgcgcgca atcgccactt gataaaagta aaagctcccg gcggtaagga taataatgac 3342 0 

aaatacaatc aaggaaataa agcctgtcac catttttttc ttcaattctg ttctctccta 33480 

gtaaactctc atgttagtta ttttaatata tcataatgat c 33521 

<210> 2 
<211> 3264 
<212> DNA 

<213> Paenibacillus strain IDAS 1529 
<400> 2 

gatcacacgg ccggcgtatt ccggctcgga accgaagaat taacagaagc gcttcagcag 60 

tccggttatc ggacagtctt tgatattgca tctgaaaatc ttgcggaatt tcagaaaagc 120 

aatccggaga ttccctcttc cgacgcgaag gagattcatc aattagccgt ccagaggaca 18 0 
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gaaaacttat 


gcatgcttta 


taaggcctgg 


caactgcaca 


atgatccggt 


cgtccagagc 


240 


cttcccaaat 


tatccgcgga 


taccggcctg 


cgaggcatgc 


gtgccgcgtt 


ggagcggagt 


300 


cttggagggg 


gagccgattt 


tggagacttg 


ttcccggagc 


gatcgccaga 


gggctatgcg 


360 


gaagcctcct 


ctatacagtc 


gcttttttcg 


ccgggacgtt 


accttacggt 


gctgtataaa 


420 


attgcgcagg 


atctccacga 


cccaaaagac 


aaactgcata 


ttgacaaccg 


ccgtccagat 


480 


ttgaagtcgc 


tgatcctcaa 


taatgacaat 


atgaaccgtg 


aggtgtcttc 


cctggatatc 


54 0 


ctgctggatg 


tgctgcagtc 


cgaaggctcc 


ggcacactga 


catccctgaa 


ggatacctac 


600 


tatccgatga 


cccttcccta 


tgatgacgac 


cttgcgcaaa 


tcaatgccgt 


ggcggaggcg 


660 


cgctcatcca 


atttgctggg 


gatctgggat 


accctgctgg 


acacgcagcg 


gacttccatc 


720 


ctgcaggatt 


ccgccgctgt 


ccaccggata 


agcaagccgc 


ggcactcggc 


atacgtcaat 


780 


cagagagtct 


ccgatgatga 


accggtattg 


atcgcgggag 


aggaattcta 


cttggagacc 


640 


ggcggtgttg 


ccgacacgac 


cccgtctccg 


ccaacgaggg 


aagcgctttc 


cttgacgcca 


900 


aacagcttcc 


gtctgctggt 


caaccccgag 


ccgacagcag 


acgacatcgc 


caatcactac 


960 


aacgttaaga 


ctcaagatcc 


tgccgctctg 


gccgccgtct 


taaatgtggt 


cgatgacttt 


1020 


tgcctgaaaa 


ccggtttgag 


ctttaatgag 


ttgctggact 


taacgatgca 


gaaggatgat 


1080 


gaatcgatcg 


gcagcgagta 


caaaagccgg 


tttgtaaaat 


ttggcggcga 


ggccaatgtt 


1140 


ccggtttcaa 


cctatggagc 


tgtatttctg 


acaggaacgg 


aagaaactcc 


gttgtgggta 


1200 


ggaaaaggag 


ctgtgataag 


ccctgcagcg 


gacgcctatg 


ttcgtaatgg 


gacatatgca 


1260 


aacacgaatt 


atggatcaga 


cactagtctt 


gttgtgaagc 


aggatgggtc 


tagtggatac 


1320 


agtagggaag 


catatatcag 


gtttgatttg 


acaggtcttt 


ccggagttgt 


ggaggaagct 


1380 


aaaatttctc 


taacaactag 


agcgaaacaa 


ttgtctagct 


taagacacca 


agctcatttg 


1440 


gtcagtgaca 


acagttggga 


tgaattgaaa 


atcacatgga 


ataacaaacc 


tgcaggagga 


1500 


gcgatcatcg 


caagctggga 


tgttcccgaa 


gttggtgaga 


atgtaaaggt 


tgatgtgacc 


1560 


cggcaagtaa 


atgatgcgct 


cgcaaacggt 


caagataaac 


tatcaattgt 


tattcgttct 


1620 


agtgcaaatt 


atggcagtct 


gggcgatgtc 


tcttatgcct 


ctagagaaca 


ccctgaaaaa 


1680 


gcctcacgac 


cttctatgga 


aatcaaggcg 


ataacgggtg 


ctggtttaaa 


ttttacggcg 


1740 


gataatgttg 


tagctctggc 


aggaagggcg 


gaaaagcttg 


tccggctggc 


gcgcagcacg 


1800 


ggactttcct 


ttgagcagtt 


ggattggctg 


attaccaata 


ccagccgtgc 


cgtaatcgaa 


1860 


catggtggag 


aactgattct 


ggataagccg 


gtactggagt 


ctgtggccga 


attcacaagg 


1920 


ctccataagc 


gttatggcat 


cacagcggat 


atgttcgccg 


cgtttatcgg 


cgaagtcaat 


1980 


acgtatgctg 


aagcaggtaa 


agagagcttt 


tatcagacga 


ttttcagcac 


ggccgaccat 


2040 
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tcggctgcct tacctttagg cgcaactttg caatttgagg tgagcaaaca ggatcgatat 2100 

gaagcgattt gctgcggggc catgggggtg accgccgatg agttctctcg tatcggcaaa 2160 

tactgctttg gcgacaacgc gcagcaagtt accgccaatg aaacaaccgt tgcgcagctt 2220 

tatcgtttag gccgaattcc tcacatgctt ggattgcgtt ttaccgaggc ggagctgttg 22 BO 

tggaaattga tggctggcgg cgaggatacc ttgctccgca cgattggcgc gaagcctcgc 2340 

agtttacaag ccttagagat tattcgccgt actgaggtcc ttttggactg gatggatgct 2400 

catcagcttg atgttgtctc cctgcaagcc atggttacca atcggtacag cggcacagcc 2460 

acgccggagc tgtacaactt tttggcacag gtgcaccaat ccacaagcag tgccgcgaac 2520 

gtgtccaaag cggatgctca ggataccctg cccgcggaca agctgttccg ggccttggcg 2580 

gtaggcttca acctgaaggc caacgtgatg gcgcaggtca ttgactggtt ggacaaaacc 2640 

gacggagcgt ttacgctgcg ggctttctgg gacaagcttc aagcgtattt cagcgccgat 2700 

catgaagaag aactgacggc cttggaagga gaagccgact tgctgcagtg gtgccaacag 2760 

atcagccagt atgcgctcat tgtccgctgg tgcgggttaa gcgatcagga tctggcgctg 2820 

ctgaccgggc atcccgggca gcttctgtcc ggacaacata cggtgccggt accctcgctg 2880 

catctcctgc tggtgctgac ccgcctgaag gaatggcagc agcgcgtcca ggtttccagc 2940 

gaggaggcca tgcgctattt tgcccaggcc gatgcgccaa ccgtcacacg cgatgctgcg 3000 

gtcaagctgc ttgcccgtat ccatggctgg aatgaacagg ataccgcctc gatgaatgac 3060 

tacctgctgg gagagaacga atatcctaag aactttgagc agatctttac tttggaaagc 3120 

tgggtcaacc tgggccgtca actgaatgtt ggcagccgaa cgttgggaga gctggttgac 3180 

atgtcagaag aggatgatac cgcggaaaac acggatttga ttatctcggt cgcccaaagc 3240 

ctgatggctg cggtgcaggc ctga 3264 

<210> 3 
<211> 1087 
<212> PRT 

<213> Paenibacillus strain IDAS 1529 
<400> 3 

Asp His Thr Ala Gly Val Phe Arg Leu Gly Thr Glu Glu Leu Thr Glu 
15 10 15 

Ala Leu Gin Gin Ser Gly Tyr Arg Thr Val Phe Asp lie Ala Ser Glu 
20 25 30 

Asn Leu Ala Glu Phe Gin Lys Ser Asn Pro Glu lie Pro Ser Ser Asp 
35 40 45 
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Ala Lys Glu He His Gin Leu Ala Val Gin Arg Thr Glu Asn Leu Cys 
50 55 60 



Met Leu Tyr Lys Ala Trp Gin Leu His Asn Asp Pro Val Val Gin Ser 
65 * " 70 75 80 

Leu Pro Lys Leu Ser Ala Asp Thr Gly Leu Arg Gly Met Arg Ala Ala 
85 90 95 



Leu Glu Arg Ser Leu Gly Gly Gly Ala Asp Phe Gly Asp Leu Phe Pro 
100 105 HO 



Glu Arg Ser Pro Glu Gly Tyr Ala Glu Ala Ser Ser He Gin Ser Leu 
115 120 125 

Phe Ser Pro Gly Arg Tyr Leu Thr Val Leu Tyr Lys He Ala Gin Asp 
130 135 140 



Leu His Asp Pro Lys Asp Lys Leu His He Asp Asn Arg Arg Pro Asp 
145 " 150 155 160 

Leu Lys Ser Leu He Leu Asn Asn Asp Asn Met Asn Arg Glu Val Ser 
165 170 175 



Ser Leu Asp He Leu Leu Asp Val Leu Gin Ser Glu Gly Ser Gly Thr 
180 185 190 



Leu Thr Ser Leu Lys Asp Thr Tyr Tyr Pro Met Thr Leu Pro Tyr Asp 
195 200 205 

Asp Asp Leu Ala Gin He Asn Ala Val Ala Glu Ala Arg Ser Ser Asn 
210 215 220 

Leu Leu Gly He Trp Asp Thr Leu Leu Asp Thr Gin Arg Thr Ser He 
225 230 235 240 

Leu Gin Asp Ser Ala Ala Val His Arg He Ser Lys Pro Arg His Ser 
245 250 255 



Ala Tyr Val Asn Gin Arg Val Ser Asp Asp Glu Pro Val Leu He Ala 
260 ~ 265 270 



Gly Glu Glu Phe Tyr Leu Glu Thr Gly Gly Val Ala Asp Thr Thr Pro 
275 280 285 
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Ser Pro Pro Thr Arg Glu Ala Leu Ser Leu Thr Pro Asn Ser Phe Arg 
290 295 300 



Leu Leu Val Asn Pro Glu Pro Thr Ala Asp Asp lie Ala Asn His Tyr 
305 310 315 320 



Asn Val Lys Thr Gin Asp Pro Ala Ala Leu Ala Ala Val Leu Asn Val 
325 330 335 



Val Asp Asp Phe Cys Leu Lys Thr Gly Leu Ser Phe Asn Glu Leu Leu 
340 345 350 



Asp Leu Thr Met Gin Lys Asp Asp Glu Ser lie Gly Ser Glu Tyr Lys 
355 360 365 



Ser Arg Phe Val Lys Phe Gly Gly Glu Ala Asn Val Pro Val Ser Thr 
370 375 380 



Tyr Gly Ala Val Phe Leu Thr Gly Thr Glu Glu Thr Pro Leu Trp Val 
385 390 395 400 



Gly Lys Gly Ala Val lie Ser Pro Ala Ala Asp Ala Tyr Val Arg Asn 
405 410 415 



Gly Thr Tyr Ala Asn Thr Asn Tyr Gly Ser Asp Thr Ser Leu Val Val 
420 425 ' 430 



Lys Gin Asp Gly Ser Ser Gly Tyr Ser Arg Glu Ala Tyr lie Arg Phe 
435 440 445 



Asp Leu Thr Gly Leu Ser Gly Val Val Glu Glu Ala Lys lie Ser Leu 
450 455 460 



Thr Thr Arg Ala Lys Gin Leu Ser Ser Leu Arg His Gin Ala His Leu 
465 470 475 480 



Val Ser Asp Asn Ser Trp Asp Glu Leu Lys lie Thr Trp Asn Asn Lys 
485 490 495 



Pro Ala Gly Gly Ala lie He Ala Ser Trp Asp Val Pro Glu Val Gly 
500 505 510 



Glu Asn Val Lys Val Asp Val Thr Arg Gin Val Asn Asp Ala Leu Ala 
515 520 525 
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Asn Gly Gin Asp Lys Leu Ser lie Val lie Arg Ser Ser Ala Asn Tyr 
530 535 540 



Gly Ser Leu Gly Asp Val Ser Tyr Ala Ser Arg Glu His Pro Glu Lys 
545 550 555 560 



Ala Ser Arg Pro Ser Met Glu He Lys Ala He Thr Gly Ala Gly Leu 
565 570 575 



Asn Phe Thr Ala Asp Asn Val Val Ala Leu Ala Gly Arg Ala Glu Lys 
580 5B5 590 



Leu Val Arg Leu Ala Arg Ser Thr Gly Leu Ser Phe Glu Gin Leu Asp 
595 600 605 



Trp Leu He Thr Asn Thr Ser Arg Ala Val He Glu His Gly Gly Glu 
610 615 620 



Leu He Leu Asp Lys Pro Val Leu Glu Ser Val Ala Glu Phe Thr Arg 
625 630 635 640 



Leu His Lys Arg Tyr Gly He Thr Ala Asp Met Phe Ala Ala Phe He 
645 650 655 



Gly Glu Val Asn Thr Tyr Ala Glu Ala Gly Lys Glu Ser Phe Tyr Gin 
660 665 670 



Thr He Phe Ser Thr Ala Asp His Ser Ala Ala Leu Pro Leu Gly Ala 
675 680 685 



Thr Leu Gin Phe Glu Val Ser Lys Gin Asp Arg Tyr Glu Ala He Cys 
690 695 700 



Cys Gly Ala Met Gly Val Thr Ala Asp Glu Phe Ser Arg He Gly Lys 
705 710 715 720 



Tyr Cys Phe Gly Asp Asn Ala Gin Gin Val Thr Ala Asn Glu Thr Thr 
725 730 735 



Val Ala Gin Leu Tyr Arg Leu Gly Arg He Pro His Met Leu Gly Leu 
740 " 745 750 



Arg Phe Thr Glu Ala Glu Leu Leu Trp Lys Leu Met Ala Gly Gly Glu 
755 760 765 



Asp Thr Leu Leu Arg Thr He Gly Ala Lys Pro Arg Ser Leu Gin Ala 
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770 775 780 



Leu Glu lie lie Arg Arg Thr Glu Val Leu Leu Asp Trp Met Asp Ala 
785 790 795 800 



His Gin Leu Asp Val Val Ser Leu Gin Ala Met Val Thr Asn Arg Tyr 
805 810 815 



Ser Gly Thr Ala Thr Pro Glu Leu Tyr Asn Phe Leu Ala Gin Val His 
820 825 830 



Gin Ser Thr Ser Ser Ala Ma Asn Val Ser Lys Ala Asp Ala Gin Asp 
835 840 845 



Thr Leu Pro Ala Asp Lys Leu Phe Arg Ala Leu Ala Val Gly Phe Asn 
850 855 860 



Leu Lys Ala Asn Val Met Ala Gin Val He Asp Trp Leu Asp Lys Thr 
865 870 875 880 



Asp Gly Ala Phe Thr Leu Arg Ala Phe Trp Asp Lys Leu Gin Ala Tyr 
885 890 895 



Phe Ser Ala Asp His Glu Glu Glu Leu Thr Ala Leu Glu Gly Glu Ala 
300 905 910 



Asp Leu Leu Gin Trp Cys Gin Gin He Ser Gin Tyr Ala Leu He Val 
915 920 925 



Arg Trp Cys Gly Leu Ser Asp Gin Asp Leu Ala Leu Leu Thr Gly His 
930 935 940 



Pro Gly Gin Leu Leu Ser Gly Gin His Thr Val Pro Val Pro Ser Leu 
945 950 955 960 



His Leu Leu Leu Val Leu Thr Arg Leu Lys Glu Trp Gin Gin Arg Val 
965 970 975 



Gin Val Ser Ser Glu Glu Ala Met Arg Tyr Phe Ala Gin Ala Asp Ala 
980 985 990 

Pro Thr Val Thr Arg Asp Ala Ala Val Lys Leu Leu Ala Arg He His 
995 1000 1005 



Gly Trp Asn Glu Gin Asp Thr Ala Ser Met Asn Asp Tyr Leu Leu 
1010 1015 1020 
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Gly Glu Abu Glu Tyr Pro Lys Asn Phe Glu Gin He Phe Thr Leu 
1025 1030 1035 

Glu Ser Trp Val Asn Leu Gly Arg Gin Leu Asn Val Gly Ser Arg 
1040 1045 1050 

Thr Leu Gly Glu Leu Val Asp Met Ser Glu Glu Asp Asp Thr Ala 
1055 1060 1065 

Glu Asn Thr Asp Leu He He Ser Val Ala Gin Ser Leu Met Ala 
1070 1075 1080 

Ala Val Gin Ala 
1085 

<210> 4 

<211> 3618 

<212> DNA 

<213> Paenibacillus strain IDAS 1529 



<400> 4 



atgaccaagg 


aaggtggtaa 


gaatatgtct 


acttcaaccc 


tgttgcaatt 


gattaaggaa 


60 


tcccgccggg 


atgcgttggt 


caaccattat 


atcgccaaca 


atgtcccgag 


agagcttacg 


120 

i 


gataagatta 


cagacgcaga 


cagcctgtat 


gagtatttgc 


tgctggatac 


caagatcagt 


180 


gaactcgtaa 


aaacatcgcc 


gatagctgag 


gccattagca 


gcgttcagtt 


atacatgaac 


240 


cgatgcgtgg 


aaggctatga 


aggcaagctg 


actccggaag 


gcaacagcca 


tttcgggccg 


300 


ggaaaattcc 


tgaataattg 


ggatacctat 


aacaagcgtt 


attccacttg 


ggccggcaag 


360 


gaacgtctga 


aatattatgc 


aggcagttat 


attgacccgt 


ccttgcgcta 


taacaaaacg 


420 


gatccgttcc 


tgaacctgga 


acagaatatc 


agccagggaa 


gaatcaccga 


tgacaccgta 


480 


aagaacgcgc 


tgcaacacta 


cctgactgaa 


tatgaagtgt 


tggcggattt 


ggaatatatc 


540 


agcgtaaata 


aaggcgccga 


tgaaagtgta 


ttattcttcg 


taggccgcac 


caaaacaatg 


600 


ccatacgaat 


attactggcg 


ccgattaacg 


ttgaaaaagg 


acaataacaa 


taaactggtg 


660 


cctgccatct 


ggtctcaatg 


gaaaaaaata 


actgccaata 


tcggcgaagc 


agttaataat 


720 


tatgtggtgc 


ttcactggca 


taataaccgc 


ttacatgtac 


aatggggttc 


tacagagaaa 


780 


acacaaaatg 


atgacggaga 


acccattgag 


aaacgatatt 


tgaatgactg 


gttcatggat 


84 0 


aagtccagtg 


tctggtcttc 


attccgaaag 


gtttcatata 


tagaaaatag 


ttttacttat 


900 


actgagggca 


tcattgattc 


aagaaatatt 


actatagctg 


gaaatcaact 


gttctgtgat 


960 


gattcaaata 


cttttaaggc 


aacaataacg 


gcacttccat 


ttgaccaaat 


acgtgtttac 


1020 
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ttagaaaaga 


tttacggtac 


aggcggcagc 


atcacggtta 


ctggagaaaa 


taaaggctat 


1080 


attattaagg 


tgggggagcc 


aagagaagtc 


agtttctctc 


ctaatacgtt 


actagatgta 


1140 


ttcataggta 


gtaatgcaag 


ccctcgagac 


ccatatttca 


aagctacatt 


taatagagaa 


1200 


gctctccaaa 


attcatacgg 


ctcaattaaa 


ataaatcaat 


acacccctcc 


ttctggaagc 


1260 


aatatcaaag 


gtcctatcga 


ccttaccctg 


aaaaataaca 


tcgacctgtc 


ggcgttgttg 


1320 


gaagagagcc 


ttgacgtact 


gttcgactat 


accattcagg 


ggaataacca 


attgggcggc 


1380 


ttagaggcct 


ttaacgggcc 


ttacggactt 


tatttgtggg 


aaatcttcct 


ccatgttcca 


1440 


tttttaatgg 


cggttcgctt 


ccacaccgag 


cagcggtatg 


agttggcgga 


acgatggttt 


1500 


aaattcattt 


tcaacagcgc 


aggttaccgt 


gatggctacg 


gcaatctgct 


gacggatgac 


1560 


aaaggcaacg 


tgcgctactg 


gaacgtcgtg 


cctctgcagg 


aggatacgga 


gtgggatgac 


1620 


acgttgtccc 


tggcaacgac 


cgacccggac 


gagattgcga 


tggccgaccc 


gatgcaatac 


1680 


aagctggcta 


tctttattca 


caccttggac 


ttcttgatca 


gccgcggcga 


cagcttgtac 


1740 


cggatgctgg 


agcgggatac 


cttgaccgaa 


gcgaagatgt 


attacattca 


ggccagccaa 


1B00 


ctgcttgggc 


ctcgtcccga 


gatccggatc 


aatcacagct 


ggcctgatcc 


gaccctgcaa 


1860 


agcgaagcgg 


acgcggtaac 


cgccgtgccg 


acgcgaagcg 


attcgccggc 


agcgccaatt 


192 0 


ctcgccttgc 


gagcgcttct 


gaatgcggaa 


aacgggcatt 


tcctgccgcc 


ttataatgat 


1980 


gaactattag 


ctttctggga 


taaaatcgac 


ctgcgtctct 


acaatttacg 


ccacaatctg 


2040 


agcctggacg 


gtcagccgct 


tcatttgccg 


ctctttaccg 


aaccggtcaa 


tcctcgtgag 


2100 


ctgcaggttc 


agcatggggc 


aggcgatgga 


ttagggggaa 


gcgccggttc 


cgtccaaagc 


2160 


cgtcaaagtg 


tctatcgttt 


tcctctggtc 


atcgataagg 


cgcgcaatgc 


cgcgagtagt 


2220 


gttatccaat 


tcgggaatgc 


cctggaaaac 


gcgctgacaa 


agcaggacag 


cgaggccatg 


2280 


actatgctgt 


tgcaatccca 


gcagcagatt 


gtcctgcagc 


aaacccgcga 


tattcaggag 


2340 


aagaacctgg 


cctcgctgca 


agcaagtctg 


gaagcaacga 


tgacagccaa 


agcgggcgcg 


2400 


aaatcccgaa 


agacccattt 


tgccggcctg 


gcggataact 


ggatgtcgca 


taatgaaacc 


2460 


gcctcacttg 


cactgcgtac 


cactgcggga 


attatcaata 


caagctcgac 


cgtgccaatc 


2520 


gctatcactg 


gcggcttgga 


tatggctccg 


aacatttttg 


gtttcgcagt 


tggaggttcc 


2580 


cgctggggag 


cagccagcgc 


ggctgtagcc 


caaggattgc 


aaatcgccgc 


cggcgtaatg 


2640 


gaacagacgg 


ccaatatcat 


cgatatcagc 


gaaagctacc 


gccggcgccg 


ggaggattgg 


2700 


ctgctgcagc 


gggatgttgc 


cgagaatgaa 


gcggcgcagt 


tggattcgca 


gattgcggcc 


2760 


ctgcgggaac 


agatggatat 


ggcgcgaaaa 


caacttgcgc 


tggcggagac 


ggaacaggca 


2820 
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cacgcgcaag 


cggtctacga 


gctgctaagc 


28 

acccgtttta cgaatcaagc tttgtataac 


2880 


tggatggccg 


gacgtctgtc 


gtctctatac 


tatcaaatgt atgacgccgc attgccgctc 


2940 


tgcttgatgg 


ccaaacaggc 


tttagagaaa 


gaaatcggca atgataaaac ggttggaatc 


3000 


ttcaccctcc 


cggcctggaa 


tgatttgtat 


cagggattgc tagcgggcga ggcgctgctg 


3060 


ctcgagcttc 


agaagctgga 


gaatctgtgg 


ctggaggagg acaagcgcgg aatggaagct 


3120 


gtaagaacgg 


tatctttaga 


tacccttctc 


cgcaaagaaa agccagaatc cggttttgca 


3180 


gatttcg'cca 


aggaagttct 


ggacggaaag 


acgcctgacc rtgtaagcgg agttagcgta 


3240 


cagctgcaaa 


acaatatttt 


cagtgcaacc 


cttgacctgt ccacccttgg cctggatcgc 


3300 


ttttacaacc 


aagcggaaaa 


ggcccacagg 


atcaaaaacc tgtcggttac cttacccgcg 


3360 


ctattgggac 


cttatcagga 


tattgcggca 


accttatcgc taggtggcga gaccgttgcg 


3420 


ctttcccatg 


gcgtggatga 


cagcggcttg 


tttatcacgg atctcaacga cagccgtttc 


3480 


ctgcctttcg 


agggtatgga 


tcctttatcc 


ggcacactcg ttctgtcgat actccatgcc 


3540 


gggcaagacg 


gtgaccagcg 


cctcctgctg 


gaaagcctga acgacgtcat cttccacatt 


3600 


cgatatgtca 


tgaaatag 






3618 



<210> 5 
<211> 1205 
<212> PRT 

<213> Paenibacillus strain IDAS 1529 
<400> 5 

Met Thr Lys Glu Gly Gly Lys Asn Met Ser Thr Ser Thr Leu Leu Gin 
15 10 15 

Leu lie Lys Glu Ser Arg Arg Asp Ala Leu Val Asn His Tyr lie Ala 
20 25 30 

Asn Asn Val Pro Arg Glu Leu Thr Asp Lys lie Thr Asp Ala Asp Ser 
35 40 45 

Leu Tyr Glu Tyr Leu Leu Leu Asp Thr Lys lie Ser Glu Leu Val Lys 
50 55 60 

Thr Ser Pro He Ala Glu Ala He Ser Ser Val Gin Leu Tyr Met Asn 
65 70 75 80 

Arg Cys Val Glu Gly Tyr Glu Gly Lys Leu Thr Pro Glu Gly Asn Ser 
85 90 95 



His Phe Gly Pro Gly Lys Phe Leu Asn Asn Trp Asp Thr Tyr Asn Lys 
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100 



105 



110 



Arg Tyr Ser Thr Trp Ala Gly Lys Glu Arg Leu Lys Tyr Tyr Ala Gly 
• 115 120 125 



Ser Tyr lie Asp Pro Ser Leu Arg Tyr Asn Lye Thr Asp Pro Phe Leu 
130 135 140 



Asn Leu Glu Gin Asn lie Ser Gin Gly Arg He Thr Asp Asp Thr Val 
145 150 155 160 



Lys Asn Ala Leu Gin His Tyr Leu Thr Glu Tyr Glu Val Leu Ala Asp 
165 170 175 



Leu Glu Tyr He Ser Val Asn Lys Gly Ala Asp Glu Ser Val Leu Phe 
180 185 190 



Phe Val Gly Arg Thr Lys Thr Met Pro Tyr Glu Tyr Tyr Trp Arg Arg 
195 200 205 



Leu Thr Leu Lys Lys Asp Asn Asn Asn Lys Leu Val Pro Ala He Trp 
210 215 220 



Ser Gin Trp Lys Lys He Thr Ala Asn He Gly Glu Ala Val Asn Asn 
225 230 235 240 



Tyr Val Val Leu His Trp His Asn Asn Arg Leu His Val Gin Trp Gly 
245 250 255 



Ser Thr Glu Lys Thr Gin Asn Asp Asp Gly Glu Pro He Glu Lys Arg 
260 265 270 



Tyr Leu Asn Asp Trp Phe Met Asp Lys Ser Ser Val Trp Ser Ser Phe 
275 280 285 



Arg Lys Val Ser Tyr He Glu Asn Ser Phe Thr Tyr Thr Glu Gly He 
290 295 300 



He Asp Ser Arg Asn He Thr He Ala Gly Asn Gin Leu Phe Cys Asp 
305 310 315 320 



Asp Ser Asn Thr Phe Lys Ala Thr He Thr Ala Leu Pro Phe Asp Gin 
325 330 335 



He Arg Val Tyr Leu Glu Lys He Tyr Gly Thr Gly Gly Ser He Thr 
340 345 350 
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Val Thr Gly Glu Asn Lys Gly Tyr He He Lys Val Gly Glu Pro Arg 
355 360 365 



Glu Val Ser Phe Ser Pro Asn Thr Leu Leu Asp Val Phe He Gly Ser 
370 375 380 

Asn Ala Ser Pro Arg Asp Pro Tyr Phe Lys Ala Thr Phe Asn Arg Glu 

385 390 395 400 



Ala Leu Gin Asn Ser Tyr Gly Ser He Lys He Asn Gin Tyr Thr Pro 
405 410 415 



Pro Ser Gly Ser Asn He Lys Gly Pro He Asp Leu Thr Leu Lys Asn 
420 425 430 



Asn He Asp Leu Ser Ala Leu Leu Glu Glu Ser Leu Asp Val Leu Phe 
435 440 445 



Asp Tyr Thr He Gin Gly Asn Asn Gin Leu Gly Gly Leu Glu Ala Phe 
450 455 460 



Asn Gly Pro Tyr Gly Leu Tyr Leu Trp Glu He Phe Leu His Val Pro 
465 470 475 480 



Phe Leu Met Ala Val Arg Phe His Thr Glu Gin Arg Tyr Glu Leu Ala 
485 490 495 



Glu Arg Trp Phe Lys Phe lie Phe Asn Ser Ala Gly Tyr Arg Asp Gly 
500 505 510 



Tyr Gly Asn Leu Leu Thr Asp Asp Lys Gly Asn Val Arg Tyr Trp Asn 
515 520 525 



Val Val Pro Leu Gin Glu Asp Thr Glu Trp Asp Asp Thr Leu Ser Leu 
530 535 540 

Ala Thr Thr Asp Pro Asp Glu He Ala Met Ala Asp Pro Met Gin Tyr 

545 ~ 550 555 560 



Lys Leu Ala He Phe He His Thr Leu Asp Phe Leu He Ser Arg Gly 
565 570 575 



Asp Ser Leu Tyr Arg Met Leu Glu Arg Asp Thr Leu Thr Glu Ala Lys 
580 585 590 
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Met Tyr Tyr lie Gin Ala Ser Gin Leu Leu Gly Pro Arg Pro Glu He 
595 600 605 



Arg lie Asn His Ser Trp Pro Asp Pro Thr Leu Gin Ser Glu Ala Asp 
610 615 620 



Ala Val Thr Ala Val Pro Thr Arg Ser Asp Ser Pro Ala Ala Pro He 
625 630 635 640 



Leu Ala Leu Arg Ala Leu Leu Asn Ala Glu Asn Gly His Phe Leu Pro 
645 650 655 



Pro Tyr Asn Asp Glu Leu Leu Ala Phe Trp Asp Lys He Asp Leu Arg 
660 665 670 



Leu Tyr Asn Leu Arg His Asn Leu Ser Leu Asp Gly Gin Pro Leu His 
675 680 685 



Leu Pro Leu Phe Thr Glu Pro Val Asn Pro Arg Glu Leu Gin Val Gin 
690 695 700 



His Gly Ala Gly Asp Gly Leu Gly Gly Ser Ala Gly Ser Val Gin Ser 
705 710 715 " 720 



Arg Gin Ser Val Tyr Arg Phe Pro Leu Val He Asp Lys Ala Arg Asn 
725 730 ' 735 



Ala Ala Ser Ser Val He Gin Phe Gly Asn Ala Leu Glu Asn Ala Leu 
740 745 750 



Thr Lys Gin Asp Ser Glu Ala Met Thr Met Leu Leu Gin Ser Gin Gin 
755 760 765 



Gin He Val Leu Gin Gin Thr Arg Asp He Gin Glu Lys Asn Leu Ala 
770 775 780 



Ser Leu Gin Ala Ser Leu Glu Ala Thr Met Thr Ala Lys Ala Gly Ala 
7B5 790 795 800 



Lys Ser Arg Lys Thr His Phe Ala Gly Leu Ala Asp Asn Trp Met Ser 
805 810 815 



His Asn Glu Thr Ala Ser Leu Ala Leu Arg Thr Thr Ala Gly He He 
820 825 830 
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Asn Thr Ser Ser Thr Val Pro lie Ala lie Thr Gly Gly Leu Asp Met 
835 840 845 

Ala Pro Asn He Phe Gly Phe Ala Val Gly Gly Ser Arg Trp Gly Ala 
850 855 860 

Ala Ser Ala Ala Val Ala Gin Gly Leu Gin He Ala Ala Gly Val Met 
865 870 875 880 

Glu Gin Thr Ala Asn He He Asp He Ser Glu Ser Tyr Arg Arg Arg 
885 890 895 

Arg Glu Asp Trp Leu Leu Gin Arg Asp Val Ala Glu Asn Glu Ala Ala 
900 905 910 

Gin Leu Asp Ser Gin He Ala Ala Leu Arg Glu Gin Met Asp Met Ala 
915 920 925 

Arg Lys Gin Leu Ala Leu Ala Glu Thr Glu Gin Ala His Ala Gin Ala 
930 935 940 

Val Tyr Glu Leu Leu Ser Thr Arg Phe Thr Asn Gin Ala Leu Tyr Asn 
945 950 955 960 

Trp Met Ala Gly Arg Leu Ser Ser Leu Tyr Tyr Gin Met Tyr Asp Ala 
965 970 975 

Ala Leu Pro Leu Cys Leu Met Ala Lys Gin Ala Leu Glu Lys Glu He 
980 985 990 

Gly Asn Asp Lys Thr Val Gly He Phe Thr Leu Pro Ala Trp Asn Asp 
995 1000 1005 

Leu Tyr Gin Gly Leu Leu Ala Gly Glu Ala Leu Leu Leu Glu Leu 
1010 1015 1020 

Gin Lys Leu Glu Asn Leu Trp Leu Glu Glu Asp Lys Arg Gly Met 
1025 1030 1035 

Glu Ala. Val Arg Thr Val Ser Leu ABp Thr Leu Leu Arg Lys Glu 
1040 1045 1050 

Lys Pro Glu Ser Gly Phe Ala Asp Phe Val Lys Glu Val Leu Asp 
1055 1060 1065 

Gly Lys Thr Pro Asp Pro Val Ser Gly Val Ser Val Gin Leu Gin 
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1070 1075 1080 



Asn Asn He Phe Ser Ala Thr Leu Asp Leu Ser Thr Leu Gly Leu 
1085 1090 1095 



Asp Arg Phe Tyr Asn Gin Ala Glu Lys Ala His Arg He Lys Asn 
1100 1105 1110 



Leu Ser Val Thr Leu Pro Ala Leu Leu Gly Pro Tyr Gin Asp He 
1115 1120 1125 



Ala Ala Thr Leu Ser Leu Gly Gly Glu Thr Val Ala Leu Ser His 
1130 1135 " 1140 



Gly Val Asp Asp Ser Gly Leu Phe lie Thr Asp Leu Asn Asp Ser 
1145 1150 1155 



Arg Phe Leu Pro Phe Glu Gly Met Asp Pro Leu Ser Gly Thr Leu 
1160 1165 1170 



Val Leu Ser He Leu His Ala Gly Gin Asp Gly Asp Gin Arg Leu 
1175 1180 ~ 1185 



Leu Leu Glu Ser Leu Asn Asp Val He Phe His He Arg Tyr Val 
1190 1195 1200 



Met Lys 
1205 



<210> 6 
<211> 3300 
<212> DNA 

<213> Paenibacillus strain IDAS 1529 
<400> 6 

atggtgtcaa caacagacaa cacggccggc gtattccggc tcggaaccga agaattaaca 60 
gaagcgctta agcagtccgg ttatcggacc gtctttgata ttgtatctga caatcttgcg 120 
gaatttcaga aaaacaatcc ggagattccc tcttctgacg cgaaggagat tcatcaatta 180 
gccgtccaga ggacagaaaa cttatgcatg ctttataagg cctggcagct gcacaatgat 240 
ccggttgtcc agagccttcc caaattatcc gcggataccg gcctgcaagg catgcgtgcc 300 
gcgttggagc ggagtcttgg aggcggagcc gattttggag acttgttccc ggagcgatcg 360 
ccagagggct atgcggaagc ctcctctata cagtcgcttt tctcgccggg acgttacctg 420 
acggtgctgt ataaaattgc gcgggatctc cacgacccaa aagataaact gcatattgac 480 
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aaccgccgtc 


cagatttgaa gtcgctgatc 


ctcaataatg 


acaatatgaa ccgagaggta 


540 


tcttctctgg 


atatccttct ggatgtgctg 


cagcccgaag 


gctctgacac gctgacatcc 


600 


ttgaaggata 


cctaccatcc gatgaccctt 


ccctatgatg 


acgaccttgc gcaaatcaat 


660 


gccgtggcgg 


aggcgcgttc atctaatttg 


ctggggattt 


gggataccct gctggacacg 


720 


cagcggactt 


ccatcctgca gaattccgcc 


gctgcccgcc 


ggataagcaa ggcgcggcac 


780 


tcggcatacg 


ccaatcagaa agcctccaat 


gatgagccgg 


tattcatcac gggagaggaa 


840 


atctacctgg 


aaaccggagg taaacggctt 


tttctggcgc 


ataaactcga gataggttca 


900 


actattagcg 


ctaaaatcaa cattggaccg 


ccgcaagcgg 


ccgatatcgc gccggcaaag 


960 


ttgcaactcg 


tatattacgg cagaggcggc 


agagggaact 


acttcctgcg cgtggcagac 


1020 


gatgtgtccc 


tcggtggaaa gctgctgacc 


aattgttatc 


tgaccagcga tgacggacag 


1080 


agcaacaata 


ttagcgggcc atactgccta 


atgatcaacc 


gaggcaccgg cagcatgcct 


114 0 


agcgggactc 


accttccagt tcagattgaa 


agagtgaccg 


atacatccat ccgcattttt 


1200 


gtgccggatc 


acggctattt ggggctaggc 


gaaagccttg 


ccagcaactg gaatgaaccg 


1260 


ttggcgctga 


atctgggctt ggatgaagcg 


ttgaccttta 


ccttgagaaa gaaggagacg 


1320 


ggaaatgaca 


ccatttccat aatcgacatg 


ctgccgccgg 


tagcgaacac gactccgtct 


1380 


ccgccgacga 


gggaaacgct ttccttgacg 


ccaaacagct 


tccgtctgct ggtcaaccct 


1440 


gagccgacag 


cggaggacat cgccaagcac 


tacaacgtca 


cgacggtaac ccgggctcct 


1500 


gccgatctgg 


cctccgcctt aaatgttgtc 


gatgatttct 


gcttgaaaac cggtttgagc 


1560 


tbtaacgaat 


tgctggattt aaccatgcag 


aaggattatc 


agtcaaaaag cagtgagtac 


1620 


aaaagccgat 


ttgtaaaatt cggcggcggg 


gagaatgttc 


cggtatcaag ctatggcgca 


1680 


gcctttctga 


caggagcgga agatactcct 


ttgtgggtga 


aacagtataa cagcgtgggg 


1740 


actgcaacaa 


gcacccctgt tttaaacttt 


acgccagata 


atgttgtggc tttggcagga 


1800 


agggcggaaa 


agcttgtccg gctgatgcgc 


agcacgggtc 


tttcctttga gcagttggat 


1860 


tggctgattg 


ccaatgccag ccgtgccgtt 


atcgaacacg 


gtggagagct ttttctggat 


1920 


aagccggtac 


tggaagctgt ggccgaattc 


acaaggctca 


ataagcgtta tggcgtcaca 


1980 


tcggatatgt 


tcgccgcgtt tatcggcgaa 


gtcaatacgt 


atacagaagc gggcaaggac 


2040 


agcttttatc 


aggcgagttt cagcacggcc 


gaccattcgg 


ctaccttacc tttgggcgct 


2100 


tctttgcaac 


ttgaggtgag caagcaggat 


cgatatgaag 


cgatttgctg cggggctatg 


2160 


ggggtgaccg 


ccgatgagtt ctcccgtatc 


ggcaaatact 


gctttgggga taaagcacag 


2220 


caaatcacgg 


ccaatgaaac aaccgttgcc 


cagctttatc 


gtttaggccg aattcctcat 


2280 


atgctaggct 


tgcgttttac cgaggcagag 


ctgttgtgga 


aattgatggc tgggggcgag 


2340 
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gataccttgc tccgcacgat tggcgcgaac cctcgcagtt tagaagcgtt agagattatt 2400 

cgccggacgg aggtcctttt ggactggatg gatgcccatc agctggatgt tgtctccctg 2460 

caagccatgg ttaccaatcg gtacagcggc acagccacgc cggagctgta caattttttg 2520 

gcacaggtgc atcaatccgc aagcagtgcc gcgaacgtgg ccagagcgga tggtcaggat 2580 

acgttgcctg cggacaagct gctccgggca ttggcggcgg gcttcaaact gaaagccaac 2640 

gtgatggcgc gagtaatcga ctggatggac aaaaccaata aagcgtttac gctgcgggct 2700 

ttctgggaca agcttcaagc gtatttcagc gccgatcatg aagaagaact gaccgccctg 2760 

gaaggagaag ccgcaatgct gcagtggtgc cagcagatca gccagtatgc gctcattgtc 282 0 

cgctggtgcg ggttaagcga gcaggatctg gcgctgctga ccgggaatcc ggagcagctt 2880 

ctggacggac aacatacggt gcccgtaccc tcgctgcatc tcctgctggt gctgacccgc 294 0 

ctgaaggaat ggcagcagcg cgtccaggtt tccagcgagg aggctatgcg ctattttgcc . 3000 

caggccgatt cgccaaccgt cacgcgcgac gatgcggtta atctgcttgc ccgtatccat 3060 

ggctggaatg aagcggatac cgtctcgatg aatgactacc tgctgggaga gaacgaatat 3120 

cctaagaact ttgatcagat ctttgcactg gaaagctggg tcaacctggg ccgtcaactg 3180 

aacgtgggca gcagaacgct gggagagctg gttgacatgg ctgaagagga taaaaccgcg 324 0 

gaaaacatgg atctgattac ttcggtggcc catagcctga tggctgcagc gaaagcctga 3300 

<210> 7 
<211> 1099 
<212> PRT 

<213> Paenibacillus strain IDAS 1529 
<40O> 7 

Met Val Ser Thr Thr Asp Asn Thr Ala Gly Val Phe Arg Leu Gly Thr 
15 10 ~ 15 

Glu Glu Leu Thr Glu Ala Leu Lys Gin Ser Gly Tyr Arg Thr Val Phe 
20 25 30 

Asp lie Val Ser Asp Asn Leu Ala Glu Phe Gin Lys Asn Asn Pro Glu 
35 40 45 

He Pro Ser Ser Asp Ala Lys Glu He His Gin Leu Ala Val Gin Arg 
50 55 60 

Thr Glu Asn Leu Cys Met Leu Tyr Lys Ala Trp Gin Leu His Asn Asp 
65 70 75 80 
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Pro Val Val Gin Ser Leu Pro Lys Leu Ser Ala Asp Thr Gly Leu Gin 
85 90 95 



Gly Met Arg Ala Ala Leu Glu Arg Ser Leu Gly Gly Gly Ala Asp Phe 
100 105 HO 



Gly Asp Leu Phe Pro Glu Arg Ser Pro Glu Gly Tyr Ala Glu Ala Ser 
115 120 125 



Ser He Gin Ser Leu Phe ser Pro Gly Arg Tyr Lgu Thr Val Leu Tyr- 
130 135 140 

Lys He Ala Arg Asp Leu His Asp Pro Lys Aisp Lys Leu His He Asp 
145 150 155 160 



Asn Arg Arg Pro Asp Leu Lys Ser Leu He Leu Asn Asn Asp Asn Met 
165 170 175 



Asn Arg Glu Val Ser Ser Leu Asp He Leu Leu Asp Val Leu Gin Pro 
180 185 190 



Glu Gly Ser Asp Thr Leu Thr Ser Leu Lys Asp Thr Tyr His Pro Met 
195 200 205 

Thr Leu Pro Tyr Asp Asp Asp Leu Ala Gin He Asn Ala Val Ala Glu 
210 215 220 

i* 

Ala Arg Ser Ser Asn Leu Leu Gly He Trp Asp Thr Leu Leu Asp Thr 
225 230 235 240 

Gin Arg Thr Ser He Leu Gin Asn Ser Ala Ala Ala Arg Arg He Ser 
245 250 255 



Lys Ala Arg His Ser Ala Tyr Ala Asn Gin Lys Ala Ser Asn Asp Glu 
260 265 270 



Pro Val Phe He Thr Gly Glu Glu He Tyr Leu Glu Thr Gly Gly Lys 
275 280 285 



Arg Leu Phe Leu Ala His Lys Leu Glu He Gly Ser Thr He Ser Ala 
290 295 300 



Lys He Asn lie Gly Pro Pro Gin Ala Ala Asp He Ala Pro Ala Lys 
305 310 315 320 



Leu Gin Leu Val Tyr Tyr Gly Arg Gly Gly Arg Gly Asn Tyr Phe Leu 
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37 
330 



335 



Arg Val Ala Asp Asp Val Ser Leu Gly Gly Lys Leu Leu Thr Asn Cys 
340 345 350 



Tyr Leu Thr Ser Asp Asp Gly Gin Ser Asn Asn lie Ser Gly Pro Tyr 
355 360 365 



Cys Leu Met lie Asn Arg Gly Thr Gly Ser Met Pro Ser Gly Thr His 
370 375 380 



Leu Pro Val Gin lie Glu Arg Val Thr Asp Thr Ser lie Arg lie Phe 
385 390 395 400 



Val Pro Asp His Gly Tyr Leu Gly Leu Gly Glu Ser Leu Ala Ser Asn 
405 410 415 



Trp Asn Glu Pro Leu Ala Leu Asn Leu Gly Leu Asp Glu Ala Leu Thr 
420 425 430 



Phe Thr Leu Arg Lys Lys Glu Thr Gly Asn Asp Thr He Ser He He 
435 440 445 



Asp Met Leu Pro Pro Val Ala Asn Thr Thr Pro Ser Pro Pro Thr Arg 
450 455 460 



Glu Thr Leu Ser Leu Thr Pro Asn Ser Phe Arg Leu Leu Val Asn Pro 
465 470 475 4B0 



Glu Pro Thr Ala Glu Asp He Ala Lys His Tyr Asn Val Thr Thr Val 
485 490 495 



Thr Arg Ala Pro Ala Asp Leu Ala Ser Ala Leu Asn Val Val Asp Asp 
500 505 510 



Phe Cys Leu Lys Thr Gly Leu Ser Phe Asn Glu Leu Leu Asp Leu Thr 
515 520 525 



Met Gin Lys Asp Tyr Gin Ser Lys Ser Ser Glu Tyr Lys Ser Arg Phe 
530 535 540 



Val Lys Phe Gly Gly Gly Glu Asn Val Pro Val Ser Ser Tyr Gly Ala 
545 550 555 560 



Ala Phe Leu Thr Gly Ala Glu Asp Thr Pro Leu Trp Val Lys Gin Tyr 
565 570 575 
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Asn Ser Val Gly Thr Ala Thr Ser Thr Pro Val Leu Asn Phe Thr Pro 
580 585 590 



Asp Asn Val Val Ala Leu Ala Gly Arg Ala Glu Lys Leu Val Arg Leu 
595 600 605 



Met Arg Ser Thr Gly Leu Ser Phe Glu Gin Leu Asp Trp Leu lie Ala 
610 615 620 



Asn Ala Ser Arg Ala Val lie Glu His Gly Gly Glu Leu Phe Leu Asp 
625 630 635 640 



Lys Pro Val Leu Glu Ala Val Ala Glu Phe Thr Arg Leu Asn Lys Arg 
645 650 655 



Tyr Gly Val Thr Ser Asp Met Phe Ala Ala Phe He Gly Glu Val Asn 
660 665 670 



Thr Tyr Thr Glu Ala Gly Lys Asp Ser Phe Tyr Gin Ala Ser Phe Ser 
675 680 685 



Thr Ala Asp His Ser Ala Thr Leu Pro Leu Gly Ala Ser Leu Gin Leu 
690 695 700 



Glu Val Ser Lys Gin Asp Arg Tyr Glu Ala He Cys Cys Gly Ala Met 
705 710 715 720 



Gly Val Thr Ala Asp Glu Phe Ser Arg He Gly Lys Tyr Cys Phe Gly 
725 730 735 



Asp Lys Ala Gin Gin He Thr Ala Asn Glu Thr Thr Val Ala Gin Leu 
740 745 750 



Tyr Arg Leu Gly Arg He Pro His Met Leu Gly Leu Arg Phe Thr Glu 
755 760 765 



Ala Glu Leu Leu Trp Lys Leu Met Ala Gly Gly Glu Asp Thr Leu Leu 
770 775 * 780 



Arg Thr He Gly Ala Asn Pro Arg Ser Leu Glu Ala Leu Glu lie lie 
785 790 795 800 



Arg Arg Thr Glu Val Leu Leu Asp Trp Met Asp Ala His Gin Leu Asp 
805 ~ 810 " 815 
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Val Val Ser Leu Gin Ala Met Val Thr Asn Arg Tyr Ser Gly Thr Ala 
820 825 830 



Thr Pro Glu Leu Tyr Asn Phe Leu Ala Gin Val His Gin Ser Ala Ser 
835 840 845 



Ser Ala Ala Asn Val Ala Arg Ala Asp Gly Gin Asp Thr Leu Pro Ala 
850 855 860 



Asp Lys Leu Leu Arg Ala Leu Ala Ala Gly Phe Lys Leu Lys Ala Asn 
865 870 875 880 

Val Met Ala Arg Val He Asp Trp Met Asp Lys Thr Asn Lys Ala Phe 
885 890 895 

Thr Leu Arg Ala Phe Trp Asp Lys Leu Gin Ala Tyr Phe Ser Ala Asp 
900 905 " 910 



His Glu Glu Glu Leu Thr Ala Leu Glu Gly Glu Ala Ala Met Leu Gin 
915 920 925 



Trp Cys Gin Gin He Ser Gin Tyr Ala Leu He Val Arg Trp Cys Gly 
930 935 940 



Leu Ser Glu Gin Asp Leu Ala Leu Leu Thr Gly Asn Pro Glu Gin Leu 
945 950 955 9S Q 



Leu Asp Gly Gin His Thr Val Pro Val Pro Ser Leu His Leu Leu Leu 
965 970 975 



Val Leu Thr Arg Leu Lys Glu Trp Gin Gin Arg Val Gin Val Ser Ser 
980 985 990 



Glu Glu Ala Met Arg Tyr Phe Ala Gin Ala Asp Ser Pro Thr Val Thr 
995 1000 1005 



Arg Asp Asp Ala Val Asn Leu Leu Ala Arg He His Gly Trp Asn 
1010 1015 1020 



Glu Ala Asp Thr Val Ser Met Asn ABp Tyr Leu Leu Gly Glu Asn 
1025 1030 1035 



Glu Tyr Pro Lys Asn Phe Asp Gin He Phe Ala Leu 
1040 1045 1050 



Glu Ser Trp 
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Val Asn Leu Gly Arg Gin Leu Asn Val Gly Ser Arg Thr Leu Gly 
1055 1060 1065 

Glu Leu Val Asp Met Ala Glu Glu Asp Lys Thr Ala Glu Asn Met 
1070 " 1075 1080 

Asp Leu lie Thr Ser Val Ala His Ser Leu Met Ala Ala Ala Lys 
1085 1090 1095 

Ala 



<210> 8 
<211> 3627 
<212> DNA 

<213> Paenibacillus strain IDAS 1529 
<40O> 8 

atgaccaagg aaggtgataa gcatatgtct acttcaaccc tgttgcaatc gattaaagaa 60 

gcccgccggg atgcgctggt caaccattat attgctaatc aggttccgac agcgcttgcg 120 

gacaagatta cggacgcgga cagcctgtat gagtacttgc tgctggatac caagatcagt 180 

gaactcgtaa aaacatcgcc gatagcggag gccatcagca gcgtgcagtt atacatgaac 240 

cgctgcgtcg aaggctatga aggcaagttg actccggaaa gtaatactca ttttggccca 300 

ggtaaatttc tatataactg ggatacgtac aacaaacgtt tttccacctg ggcaggaaaa 360 

gaacgcttga aatattatgc aggcagctat attgagccgt ccttgcgcta caacaaaacc 420 

gatccattcc tgaacctgga acagagcatc agccagggaa gaattactga tgataccgta 480 

aagaacgcgc tgcaacacta cctgactgaa tatgaagtgt tggcggatct ggattatatc 540 

agcgttaata aaggcggcga cgaaagtgtt ttactctttg ttggacgcac caaaaccgta 600 

ccgtatgaat actactggcg ccgtttgctt ttaaaaaggg acaataataa taagctagta 660 

ccagcagtct ggtctcagtg gaaaaaaatc agtgccaata tcggtgaagc ggttgatagt 720 

tatgtggtgc ctcggtggca taaaaaccgg ctacatgtgc aatggtgttc tatagagaaa 780 

agtgaaaatg atgccggtga acccattgag aaacgatatt tgaatgactg gttcatggat 840 

agttccggag tctggtcttc atttcgaaag attccggttg tggaaaagag tttcgaatat 900 

ttggacggaa gcctcgatcc ccgatttgtc gctcttgtta gaaatcaaat attaattgat 960 

gagccagaaa tattcagaat tacagtatca gcccctaatc cgatagatgc aaatggaaga 1020 

gtagaggtac attttgaaga aaactatgca aacagatata atattaccat taaatatggg 1080 

acaacgagtc ttgctattcc tgcagggcag gtagggcatc caaatatctc tattaatgaa 1140 

acattaaggg ttgaattcgg caccaggccg gattggtatt atactttcag atatttagga 1200 
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aatacaatcc aaaactcata cggttcaatt gtcaataatc aattttcacc tccatcagga 1260 
agcaatatta aaggtcctat cgaccttacc ctgaaaaata acatcgacct gtcggccttg 1320 
ttggatgaga gccttgacgc actgttcgac tataccattc agggcgataa ccaattgggc 1380 
ggcttagctg cctttaacgg gccttacgga ctttacttgt gggaaatctt cttccatgtt 1440 
ccttttttaa tggcggttcg cttccacacc gagcagcggt atgagttggc ggaacgttgg 1500 
tttaaattca tcttcaacag cgcaggatac cgtgatgatt acggcagtct gctgacggat 1560 
gacaaaggca acgtgcgtta ctggaacgtg ataccgctgc aagaggacac ggagtgggat 1620 
gacacgttgt ccctggcaac gaccgacccg gacgagattg cgatggccga cccgatgcaa 1680 
tacaagctgg ctatatttat tcacaccatg gacttcctga tcagccgcgg cgatagcttg 174 0 

taccggatgc tggagcggga taccctggcc gaagccaaga tgtattacat tcaggccagc 18 00 

caactgcttg ggccccgccc cgacatccgg ctcaatcaca gttggcctaa tccgaccttg 1860 

caaagcgaag cggacgcggt aaccgccgtg ccgacgcgaa gcgattcgcc ggcagcgcca 1920 

attttggcct tgcgagcgct tctgacaggc gaaaacggtc atttcctgcc gccttataat 1980 

gatgaactgt tcgctttctg ggacaaaatc gatctgcgtt tatacaattt gcgccacaat 2040 

ttgagtcfcgg acggtcagcc gcttcatttg ccgctctttg ccgaaccggt caatccgcgt 2100 

gaattgcagg ttcagcatgg cccgggcgat ggcttggggg gaagcgcggg ttccgcccaa 2160 

agccgtcaga gtgtctatcg ttttcctctg gtcatcgata aggcgcgcaa tgcggccaac 2220 

agtgtcatcc aattcggcaa tgccctggaa aacgcactga ccaagcaaga cagcgaagca 2280 

atgaccatgc tgttgcagtc ccagcagcag attgtcctgc agcaaacccg cgatattcag 2340 

gagaagaacc tggccgcgct gcaagcaagt ctggaagcaa cgatgacagc gaaagcgggg 24 00 

gcggagtccc ggaagaccca ttttgccggc ttggcggaca actggatgtc ggacaatgaa 2460 

accgcctcac tcgcactgcg taccaccgcg ggaatcatca ataccagctc aaccgtgccg 2520 

atcgccatca ccggcggctt ggatatggct ccgaacattt ttggtttcgc agttggaggt 2580 

tcccgctggg gagcagccag cgcggctgta gcccaaggat tgcaaatcgc cgccggcgta 2640 

atggaacaga cggccaatat tatcgatatt agcgaaagct accgccggcg ccgggaggat 2700 

tggctgctgc agcgggatgt tgccgaaaat gaagcggcgc agttggattc gcagattgcg 2760 

gccctgcggg aacagatgga tatggcgcgc aagcaacttg cgctggcgga gacggaacag 2820 

gcgcacgcgc aagcggtcta cgagctgcaa agcacccgct ttacgaatca agctttgtat 2880 

aactggatgg ctggacgtct gtcgtctcta tactatcaaa tgtatgacgc cgcattgccg 294 0 

ctctgcttga tggcgaagca ggctttagag aaagaaatcg gttcggataa aacggtcgga 3000 
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gtcttgtccc 


tcccggcctg 


gaatgatcta 


tatcagggat tattggcggg 


cgaggcgctg 


3060 


ctgctcgagc 


ttcagaagct 


ggagaatctg 


tggctggagg aagacaagcg 


cggaatggaa 


3120 


gccgtaaaaa 


cagtctctct 


ggatactctt 


ctccgcaaaa caaatccgaa 


ctccgggttt 


3180 


gcggatctcg 


tcaaggaggc 


actggacgaa 


aacggaaaga cgcctgaccc ggtgagcgga 


3240 


gtcggcgtac 


agctgcaaaa 


caatattttc 


agcgcaaccc ttgacctctc 


cgttcttggc 


3300 




/-i +- a raa^ra 

C U Ud^CtCtU^A 




tcccgcagga tcaaaaatat 


gtcggttacc 


3360 


ttacctgcgc 


tattyyyyCC 




atagaggcaa ccttafccgct 


aggcggcgag 


3420 


accgttgcgc 


tgtcccatgg 


cgtggatgac 


agcggcttgt tcatcactga 


tctcaacgac 


3480 


agccggttcc 


tgcctttcga 


gggcatggat 


ccgttatccg gcacactcgt 


cctgtcgata 


3540 


ttccatgccg 


ggcaagacgg 


cgaccagcgc 


ctcctgctgg aaagtctcaa 


tgacgtcatc 


3600 


ttccacattc 


gatatgttat 


gaaatag 






3627 



<210> 9 
<211> 1208 
<212> PRT 

<213> Paenibacillus strain IDAS 1529 
<400> 9 

Met Thr Lys Glu Gly Asp Lys His Met Ser Thr Ser Thr Leu Leu Gin 
15 10 15 



Ser lie Lys Glu Ala Arg Arg Asp Ala Leu Val Asn His Tyr lie Ala 
20 25 30 



Asn Gin Val Pro Thr Ala Leu Ala Asp Lys He Thr Asp Ala Asp Ser 
35 40 45 



Leu Tyr Glu Tyr Leu Leu Leu Asp Thr Lys He Ser Glu Leu Val Lys 
50 55 60 



Thr Ser Pro He Ala Glu Ala He Ser Ser Val Gin Leu Tyr Met Asn 
65 70 75 80 



Arg Cys Val Glu Gly Tyr Glu Gly Lys Leu Thr Pro Glu Ser Asn Thr 
85 90 95 



His Phe Gly Pro Gly Lys Phe Leu Tyr Asn Trp Asp Thr Tyr Asn Lys 
100 105 110 



Arg Phe Ser 
115 



Thr Trp Ala Gly Lys Glu 
120 



Arg Leu Lys Tyr Tyr Ala Gly 
125 
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Ser Tyr He Glu Pro Ser Leu Arg Tyr Asn Lys Thr Asp Pro Phe Leu 
130 135 140 



Asn Leu Glu Gin Ser lie Ser Gin Gly Arg He Thr Asp Asp Thr Val 
145 150 155 * 160 



Lys Asn Ala Leu Gin His Tyr Leu Thr Glu Tyr Glu Val Leu Ala Asp 
165 170 175 



Leu Asp Tyr He Ser Val Asn Lys Gly Gly Asp Glu Ser Val Leu Leu 
180 185 iso 



Phe Val Gly Arg Thr Lys Thr Val Pro Tyr Glu Tyr Tyr Trp Arg Arg 
195 200 205 



Leu Leu Leu Lys Arg Asp Asn Asn Asn Lys Leu Val Pro Ala Val Trp 
210 215 220 



Ser Gin Trp Lys Lys He Ser Ala Asn He Gly Glu Ala Val Asp Ser 
225 230 235 240 



Tyr Val Val Pro Arg Trp His Lys Asn Arg Leu His Val Gin Trp Cys 
245 250 255 



Ser He Glu Lys Ser Glu Asn Asp Ala Gly Glu Pro He Glu Lys Arg 
260 265 270 



Tyr Leu Asn Asp Trp Phe Met Asp Ser Ser Gly Val Trp Ser Ser Phe 
275 280 285 



Arg Lys He Pro Val Val Glu Lys Ser Phe Glu Tyr Leu Asp Gly Ser 
290 295 3 00 



Leu Asp Pro Arg Phe Val Ala Leu Val Arg Asn Gin He Leu He Asp 
305 310 315 320 



Glu Pro Glu He Phe Arg He Thr Val Ser Ala Pro Asn Pro He Asp 
325 330 335 



Ala Asn Gly Arg Val Glu Val His Phe Glu Glu Asn Tyr Ala Asn Arg 
340 345 350 



Tyr Asn He Thr He Lys Tyr Gly Thr Thr Ser Leu Ala He Pro Ala 
355 360 365 
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Gly Gin Val Gly His Pro Asn He Ser He Asn Glu Thr Leu Arg Val 
370 375 380 



Glu Phe Gly Thr Arg Pro Asp Trp Tyr Tyr Thr Phe Arg Tyr Leu Gly 
385 390 395 400 



Asn Thr He Gin Asn Ser Tyr Gly Ser He Val Asn Asn Gin Phe Ser 
405 410 415 



Pro Pro Ser Gly Ser Asn He Lys Gly Pro He Asp Leu Thr Leu Lys 
420 425 430 



Asn Asn He Asp Leu Ser Ala Leu Leu Asp Glu Ser Leu Asp Ala Leu 
435 440 445 



Phe Asp Tyr Thr He Gin Gly Asp Asn Gin Leu Gly Gly Leu Ala Ala 
450 455 460 



Phe Asn Gly Pro Tyr Gly Leu Tyr Leu Trp Glu He Phe Phe His Val 
465 • ~ 470 475 480 



Pro Phe Leu Met Ala Val Arg Phe His Thr Glu Gin Arg Tyr Glu Leu 
485 490 495 



Ala Glu Arg Trp Phe Lys Phe He Phe Asn Ser Ala Gly Tyr Arg Asp 
500 505 510 



Asp Tyr Gly Ser Leu Leu Thr Asp Asp Lys Gly Asn Val Arg Tyr Trp 
515 520 525 



Asn Val He Pro Leu Gin Glu Asp Thr Glu Trp Asp Asp Thr Leu Ser 
530 535 540 



Leu Ala Thr Thr Asp Pro Asp Glu He Ala Met Ala Asp Pro Met Gin 
545 550 555 560 



Tyr Lys Leu Ala He Phe He His Thr Met Asp Phe Leu He Ser Arg 
565 570 575 



Gly Asp Ser Leu Tyr Arg Met Leu Glu Arg Asp Thr Leu Ala Glu Ala 
580 585 590 



Lys Met Tyr Tyr He Gin Ala Ser Gin Leu Leu Gly Pro Arg Pro Asp 
595 600 605 



He Arg Leu Asn His Ser Trp Pro Asn Pro Thr Leu Gin Ser Glu Ala 
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610 615 620 



Asp Ala Val Thr Ala Val Pro Thr Arg Ser Asp Ser Pro Ala Ala Pro 
625 630 635 640 



lie Leu Ala Leu Arg Ala Leu Leu Thr Gly Glu Asn Gly His Phe Leu 
645 650 ' 655 



Pro Pro Tyr Asn Asp Glu Leu Phe Ala Phe Trp Asp Lys He Asp Leu 
660 665 ~ 670 



Arg Leu Tyr Asn Leu Arg His Asn Leu Ser Leu Asp Gly Gin Pro Leu 
675 680 685 



His Leu Pro Leu Phe Ala Glu Pro Val Asn Pro Arg Glu Leu Gin Val 
690 695 700 



Gin His Gly Pro Gly Asp Gly Leu Gly Gly Ser Ala Gly Ser Ala Gin 
705 710 715 720 



Ser Arg Gin Ser Val Tyr Arg Phe Pro Leu Val He Asp Lys Ala Arg 
725 730 735 



Asn Ala Ala Asn Ser Val He Gin Phe Gly Asn Ala Leu Glu Asn Ala 
740 745 750 



Leu Thr Lys Gin Asp Ser Glu Ala Met Thr Met Leu Leu Gin Ser Gin 
755 760 765 



Gin Gin He Val Leu Gin Gin Thr Arg Asp He Gin Glu Lys Asn Leu 
770 775 780 



Ala Ala Leu Gin Ala Ser Leu Glu Ala Thr Met Thr Ala Lys Ala Gly 
785 790 795 ' 800 



Ala Glu Ser Arg Lys Thr His Phe Ala Gly Leu Ala Asp Asn Trp Met 
805 810 815 



Ser Asp Asn Glu Thr Ala Ser Leu Ala Leu Arg Thr Thr Ala Gly He 
820 825 830 



He Asn Thr Ser Ser Thr Val Pro He Ala He Thr Gly Gly Leu Asp 
835 840 845 



Met Ala Pro Asn He Phe Gly Phe Ala Val Gly Gly Ser Arg Trp Gly 
850 855 860 
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Ala Ala Ser Ala Ala Val Ala Gin Gly Leu Gin He Ala Ala Gly Val 
865 870 875 880 



Met Glu Gin Thr Ala Asn He He Asp He Ser Glu Ser Tyr Arg Arg 
885 890 895 



Arg Arg Glu Asp Trp Leu Leu Gin Arg Asp Val Ala Glu Asn Glu Ala 
900 905 910 



Ala Gin Leu Asp Ser Gin He Ala Ala Leu Arg Glu Gin Met Asp Met 
915 920 925 



Ala Arg Lys Gin Leu Ala Leu Ala Glu Thr Glu Gin Ala His Ala Gin 
930 935 940 



Ala Val Tyr Glu Leu Gin Ser Thr Arg Phe Thr Asn Gin Ala Leu Tyr 
945 * 950 955 960 



Asn Trp Met Ala Gly Arg Leu Ser Ser Leu Tyr Tyr Gin Met Tyr Asp 
965 970 975 



Ala Ala Leu Pro Leu Cys Leu Met Ala Lys Gin Ala Leu Glu Lys Glu 
980 985 990 



He Gly Ser Asp Lys Thr Val Gly Val Leu Ser Leu Pro Ala Trp Asn 
995 1000 1005 



Asp Leu Tyr Gin Gly Leu Leu Ala Gly Glu Ala Leu Leu Leu Glu 
1010 1015 1020 



Leu Gin Lys Leu Glu Asn Leu Trp Leu Glu Glu Asp Lys Arg Gly 
1025 1030 1035 



Met Glu Ala Val Lys Thr Val Ser Leu Asp Thr Leu Leu Arg Lys 
1040 1045 1050 



Thr Asn Pro Asn Ser Gly Phe Ala Asp Leu Val Lys Glu Ala Leu 
1055 1060 1065 



Asp Glu Asn Gly Lys Thr Pro Asp Pro Val Ser Gly Val Gly Val 
1070 1075 1080 



Gin Leu Gin Asn Asn He Phe Ser Ala Thr Leu Asp Leu Ser Val 
1085 1090 1095 
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Leu Gly Leu Asp Arg Ser Tyr Asn Gin Ala Glu Lys Ser Arg Arg 
1100 1105 mo 



He Lys Asn Met Ser Val Thr Leu Pro Ala Leu Leu Gly Pro Tyr 
1115 1120 1125 



Gin Asp lie Glu Ala Thr Leu Ser Leu Gly Gly Glu Thr Val Ala 
1130 1135 H40 



Leu Ser His Gly Val Asp Asp Ser Gly Leu Phe He Thr Asp Leu 
1145 1150 H55 



Asn Asp Ser Arg Phe Leu Pro Phe Glu Gly Met Asp Pro Leu Ser 
1160 1165 H70 



Gly Thr Leu Val Leu Ser He Phe His Ala Gly Gin Asp Gly Asp 
1175 H80 H85 



Gin Arg Leu Leu Leu Glu Ser Leu Asn Asp Val He Phe His He 
1190 H95 1200 



Arg Tyr Val Met Lys 
1205 



<210> 10 
<211> 4335 
<212> DNA 

<213> Paenibacillus strain IDAS 1529 
<400> 10 

atgccacaat ctagcaatgc cgatatcaag ctattgtcgc catcgctgcc aaagggcggc 60 

ggttccatga agggaatcga agaaaacatc gcggctcccg gctccgacgg catggcacgt 120 

tgtaatgtgc cgctgccggt aacctccggc cgctatatta ctcctgatat aagcctgtcc 180 

tatgcgagcg gccacggcaa cggcgcttat ggaatgggct ggacgatggg agtgatgagc 240 

attagccgga gaacaagccg agggaccccc agttatacat ccgaagacca gttccttggt 300 

ccggatgggg aggtgcttgt tccggaaagc aacgaacaag gggagatcat tacccgccac 360 

accgatacgg cccaagggat accgttaggc gagacgttta cggttacacg ctattttccc 420 

cggatcgaga gcgcttttca tttgctggaa tactgggaag cgcaagcagg aagcgcaaca 480 

gcgtcgtttt ggcttattca ctctgccgat ggagtgctgc actgtctggg taaaactgct 540 

caggcgagga tagccgcccc tgacgattcc gccaagatcg cagaatggct agtggaggag 600 

tccgtctccc ccttcggaga gcatatttat taccaataca aagaagaaga caatcaaggc 660 
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gtgaatctgg aggaagacaa 


tcatcaatat ggggegaace 


gctatctgaa 


atcuattcQc 


720 


tatggaaata aggttgcctc 


tccttctctc tatgtctgga 


acracrcrcraaa t 


tccggcagac 


780 


Qaccaataac tatattcccrt 


tatcctggat tatggegaga 


acgatacctc 


ageggatgt t 


840 


cctcccctat acacgcccca 


aoctcrcfacftcrcr ctocrtacacc 

n 3333 a 3 L yy w>i-.yy L.vjw>yv**-> 


coaac cert 1 1 

^•33 w^j w w w 


ttccccrctat 


900 


cractacaaat ttaaaotcca 


cracttacccrc ttataccacc 


aaatcttaat 


attccacatc 


960 


tttaaaaaac ttaocaaaaa 


crccaaccicta crtcrt crcrcaaa 
y u<-yy<~.yuuy y L.y uyy uyy a 


fccrcaat tacra 

Lyuay i— i— y y c* 


at"accracoao 


1020 


l„# W L>» y y 3 V»y La La-Va-O La^*»a La 


y oy^y y y ^yy *-«c* i— ^»y y 


c- 1" t - a t" era a rrr~ 
wll.q uy aay u» 


a r*f . f *f *^t*t*^ ^ 

ayc*L*333^ww 


■A. IA -A< V 


at* h r*aaaar"t" t~arT , arT , ar , t~ 


acraattcaat tatactccat 


hhoapa t~ rna 
w wyy^uuuya 


aa r*a a caccc 


114 0 


1~ t" nnr^nr f t~ t" t~ t~ p I - apr"* 


hcrfarrfoaa hnoopnciatn 
i»y uyi«^Lyaa "sys*yy~"3 


a a na aratha 

aay aaU*aU LA 


f- pant* t*aat* 
Lway l. uyy u w 


1200 


rraf- t" t~ at" a fa aaaaaaar'at"' 


a c canac t" t" a t*t*ahat"raaa 


araahaarra 

a uaa uy auuu 


rfoarahtah 

w Lyy LtU La l. u l* 


1260 


^ML. U^H^^^H w ^wyyyywyQ 


^uwauwyya^ y y y g w wy ww u* 


a t~ a a r»a ar»t* a 


y yy ^y^^ 


1320 


{"•ft* r*at*at" rr» r*r i at~aaa r»t" a 

V<LLl>aLCll>UO L J LjOOU LaL* 


^*^yy aaL yyy aL y ( - L y ai -yy 


af rf na a t* an 
a tyactLyy 


a na e^ctctctt" a t~ 
ayauyiyy lciu 


1 *4 RO 

JLO O V" 


rhooaataah t*af , t*t*ar»aaa 


auuuyyyy i_ v_ yt_yyyyt_yt - .u 


a t" a ora t" rra a 
a uayua uy act 


r*rT*»cfcra t~ a a a 


1440 


aactcrcrtcea. othttcrfccrcc 


Cffh rr'acrar'a r^orraarcra 

y %«» i» w>uoy y wa wt*>y wwaawy y 


aat*t*r , t*tT' <**a 


LL>L>yLayyL<a 


1500 


raarl"t*cipra af afhan can 

way L_ l_ L»ywwa ClL^Ll.aL<L-^VJ 


a tT»aaat* t* t* a afrcrarhhact 
yyy o^wyout. Lyy 


t" t* a t* na t" errrt 
u LaLya L^yy 


rnnaananr 
La LLyaayay k_ 


-LOO U 


at* r* r*aat" t" t~ t~ at"af"r"aeraaa 


oycioy v,yyy u u w day >—y uy 


raf- rrf- <*»a a nt~ 
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yy y l lll 
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gatatcattt atgettcteg 


tcracCTctcta ctcatttacc 


aa aatT'tt'tr* 


paara a t* aar* 


1920 


tt tact crate CQcrtaccrcrot 


tccgctgcct gacggcgtgc 


ggtttgataa 


t ctgtgccgg 


1980 


ctgctgcctg ccgatatccg 


cgggttaggt gtggccagtc 


tggtgctgca 


tgtaccttac 


2040 


atggcccccc gcagttggaa 


attagatttc tttgeggega 


ageegtattt 


attgeaaacg 


2100 


gtcagcaaca atcttggagc 


ttccagctcg ttttggtacc 


gaagctccac 


ccaatattaa 


2160 


ctggatgaga aacaggegge 


ctcatcggct gtctccgctt 


tgcccttccc 


gataaaegtg 


2220 


gtateggata tgcacacggt 


ggacgaaatc agcggccgca 


ccaggactca 


gaagtatact 


2280 


taccgccatg gcgtgtatga 


ccggaccgaa aaggaatttg 


ceggattegg 


ccgcattgac 


2340 


acatgggaag aggageggga 


ttccgaagga accctgagcg 


tcagcactcc 


gcccgtgctg 


2400 


acgcggacct ggtatcatac 


egggcaaaag caggatgagg 


agcgtgccgt 


gcagcaatat 


2460 


tggcaaggcg accctgcggc 


ttttcaggtt aaacccgtcc 


ggcttactcg 


attcgatgeg 


2520 
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gcagcggccc aggatctgcc gctagattct aataatgggc agcaagaata ctggctgtac 2580 
cgatcattac aagggatgcc gctgcggact gagatttttg cgggagatgt tggcgggtcg 2640 
cctccttatc aggtagagag cttccgttat caagtgcgct tggtgcagag catcgattcg 2700 
gaatgtgttg ccttgcccat gcagttggag cagcttacgt acaactatga gcaaatcgcc 2760 
tctgatccgc agtgttcaca gcagatacag caatggttcg acgaatacgg cgtggcggca 2820 
cagagtgtaa caatccaata tccgcgccgg gcacagccgg aggacaatcc gtaccctcgc 2880 
acgctgccgg ataccagctg gagcagcagt tatgattcgc agcaaatgct gctgcggttg 2940 
accaggcaaa ggcaaaaagc gtaccacctt gcagatcctg aaggctggcg cttgaatatt 3000 
ccccatcaga cacgcctgga tgccttcatt tattctgctg acagcgtgcc cgccgaagga 3060 
ataagcgccg agctgctgga ggtggacggc acgttacgat cttcggcgct ggaacaggct 3120 

tatggcggcc agtcagagat catctatgcg ggcgggggcg aaccggattt gcgagccctg 3180 

gtccattaca ccagaagcgc ggttcttgat gaagactgtt tacaagccta tgaaggcgta 324 0 

ctgagcgata gccaattgaa ctcgcttctt gcctcttccg gctatcaacg aagcgcaaga 3300 

atattgggtt cgggcgatga agtggatatt tttgtcgcgg aacaaggatt tacccgttat 3360 

gcggatgaac cgaatttttt ccgtattctg gggcaacaat cctctctctt gtccggggaa 3420 

caagtattaa catgggatga taatttctgt gcggttacat ccatcgaaga cgcgcttggc 3480 

aatcaaattc agattgcata tgattaccgc tttgtggagg ccatccagat taccgatacg 354 0 

aataataatg tgaatcaggt cgccctggat gctctcggcc gggtcgtata cagccggacc 3600 

tggggcacgg aggaagggat aaagaccggc ttccgcccgg aggtggaatt cgcgacgccc 3660 

gagacaatgg agcaggcgct tgccctggca tctcccttgc cggttgcatc ctgctgtgta 3720 

tatgatgcgc atagctggat gggaacgata actcttgcac aactgtcaga gcttgttcca 3780 

gatagtgaaa agcaatggtc gttcttgata gacaatcgct tgattatgcc ggacggcaga 384 0 

atcagatccc gcggtcggga tccatggtcg cttcaccggc tattgccgcc tgctgtgggc 3900 ' 

gaattgctga gcgaggcgga ccgtaaaccg ccgcatacgg taattttggc agcagatcgt 3960 

tacccggatg acccatccca gcaaattcag gcgagcatcg tgtttagcga tggctttggg 4020 

cgtacgatac aaactgctaa aagagaagat acccgatggg cgattgcgga acgggtggac 4 080 

tatgacggaa ccggagccgt aatccgcagc tttcagcctt tttatcttga cgactggaat 4140 

tatgtgggcg aagaggctgt cagcagctct atgtacgcaa cgatctatta ttatgatgct 4200 

ctggcacgac aattaaggat ggtcaacgct aaaggatatg agaggagaac tgctttttac 4260 

ccatggttta cagtaaacga agatgaaaat gataccatgg actcatcatt atttgcttca 4320 
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ccgcctgcgc ggtga 4335 



<210> 11 

<211> 1444 

<212> PRT 

<213> Paenibacillus strain IDAS 1529 

<400> 11 

Met Pro Gin Ser Ser Asn Ala Asp He Lys Leu Leu Ser Pro Ser Leu 
1 5 10 15 



Pro Lys Gly Gly Gly Ser Met Lys Gly He Glu Glu Asn He Ala Ala 
20 25 30 



Pro Gly Ser Asp Gly Met Ala Arg Cys Asn Val Pro Leu Pro Val Thr 
35 ~ 40 45 



Ser Gly Arg Tyr He Thr Pro Asp He Ser Leu Ser Tyr Ala Ser Gly 
50 55 60 



His Gly Asn Gly Ala Tyr Gly Met Gly Trp Thr Met Gly Val Met Ser 
65 70 75 80 



He Ser Arg Arg Thr Ser Arg Gly Thr Pro Ser Tyr Thr Ser Glu Asp 
85 90 95 



Gin Phe Leu Gly Pro Asp Gly Glu Val Leu Val Pro Glu Ser Asn Glu 
100 105 110 



Gin Gly Glu He He Thr Arg His Thr Asp Thr Ala Gin Gly He Pro 
115 120 125 



Leu Gly Glu Thr Phe Thr Val Thr Arg Tyr Phe Pro Arg He Glu Ser 
130 135 140 



Ala Phe His Leu Leu Glu Tyr Trp Glu Ala Gin Ala Gly Ser Ala Thr 
145 150 155 160 



Ala Ser Phe Trp Leu He His Ser Ala Asp Gly Val Leu His Cys Leu 
165 170 " 175 



Gly Lys Thr Ala Gin Ala Arg He Ala Ala Pro Asp Asp Ser Ala Lys 
180 185 190 



He Ala Glu Trp Leu Val Glu Glu Ser Val Ser Pro Phe Gly Glu His 
195 200 205 



WO 2004/002223 PCT/US2003/020082 



51 

He Tyr Tyr Gin Tyr Lys Glu Glu Asp Asn Gin Gly Val Asn Leu Glu 
210 215 220 

Glu Asp Asn His Gin Tyr Gly Ala Asn Arg Tyr Leu Lys Ser He Arq 
225 230 235 240 

Tyr Gly Asn Lys Val Ala Ser Pro Ser Leu Tyr Val Trp Lys Gly Glu 
245 250 255 

He Pro Ala Asp Gly Gin Trp Leu Tyr Ser Val He Leu Asp Tyr Gly 
260 265 270 

Glu Asn Asp Thr Ser Ala Asp Val Pro Pro Leu Tyr Thr Pro Gin Glv 
275 280 285 

Glu Trp Leu Val Arg Pro Asp Arg Phe Ser Arg Tyr Asp Tyr Gly Phe 
290 295 300 

Glu Val Arg Thr Cys Arg Leu Cys Arg Gin Val Leu Met Phe His Val 
305 310 315 320 

Phe Lys Glu Leu Gly Gly Glu Pro Ala Leu Val Trp Arg Met Gin Leu 
325 330 335 

Glu Tyr Asp Glu Asn Pro Ala Ala Ser Met Leu Ser Ala Val Arg Gin 
340 345 350 

Leu Ala Tyr Glu Ala Asp Gly Ala He Arg Ser Leu Pro Pro Leu Glu 
355 360 3 6 5 

Phe Asp Tyr Thr Pro Phe Gly He Glu Thr Thr Ala Asp Trp Gin Pro 
370 375 380 

Phe Leu Pro Val Pro Glu Trp Ala Asp Glu Glu His Tyr Gin Leu Val 
385 390 395 4Q0 

Asp Leu Tyr Gly Glu Gly He Pro Gly Leu Leu Tyr Gin Asn Asn Asp 
405 410 415 

His Trp His Tyr Arg Ser Pro Ala Arg Gly Asp Thr Pro Asp Gly He 
420 425 430 



Ala Tyr Asn Ser Trp Arg Pro Leu Pro His lie Pro Val Asn Ser 
435 440 445 
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Asn Gly Met Leu Met Asp Leu Asn Gly Asp Gly Tyr Leu Glu Trp Leu 
450 455 460 



Leu Ala Glu Pro Gly Val Ala Gly Arg Tyr Ser Met Asn Pro Asp Lys 
465 470 475 480 



Ser Trp Ser Gly Phe Val Pro Leu Gin Ala Leu Pro Thr Glu Phe Phe 
485 490 495 



His Pro Gin Ala Gin Leu Ala Asn val Thr Gly Ser Gly Leu Thr Asp 
500 505 510 



Leu Val Met He Gly Pro Lys Ser Val Arg Phe Tyr Ala Gly Glu Glu 
515 520 525 



Ala Gly Phe Lys Arg Ala Cys Glu Val Trp Gin Gin Val Gly He Thr 
530 535 540 



Leu Pro Val Glu Arg Val Asp Lys Lys Glu Leu Val Ala Phe Ser Asp 
545 550 555 560 

Met Leu Gly Ser Gly Gin Ser His Leu Val Arg He Arg His Asp Gly 
565 570 575 



Val Thr Cys Trp Pro Asn Leu Gly Asn Gly Val Phe Gly Ala Pro Leu 
580 585 590 



Ala Leu His Gly Phe Thr Ala Ser Glu Arg Glu Phe Asn Pro Glu Arg 
595 600 605 



Val Tyr Leu Val Asp Leu Asp Gly Ser Gly Ala Ser Asp He He Tyr 
610 615 620 



Ala Ser Arg Asp Ala Leu Leu He Tyr Arg Asn Leu Ser Gly Asn Gly 

625 ~ 630 635 64 0 

Phe Ala Asp Pro Val Arg Val Pro Leu Pro Asp Gly Val Arg Phe Asp 

645 650 655 



Asn Leu Cys Arg Leu Leu Pro Ala Asp He Arg Gly Leu Gly Val Ala 
660 665 670 



Ser Leu Val Leu His Val Pro Tyr Met Ala Pro Arg Ser Trp Lys Leu 
675 680 685 



Asp Phe Phe Ala Ala Lys Pro Tyr Leu Leu Gin Thr Val Ser Asn Asn 
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690 695 700 



Leu Gly Ala Ser Ser Ser Phe Trp Tyr Arg Ser Ser Thr Gin Tyr Trp 
705 710 715 720 



Leu Asp Glu Lys Gin Ala Ala Ser Ser Ala Val Ser Ala Leu Pro Phe 
725 730 735 



Pro He Asn Val Val Ser Asp Met His Thr Val Asp Glu He Ser Gly 
740 745 750 



Arg Thr Arg Thr Gin Lys Tyr Thr Tyr Arg His Gly Val Tyr Asp Arg 
755 760 765 



Thr Glu Lys Glu Phe Ala Gly Phe Gly Arg He Asp Thr Trp Glu Glu 
770 775 780 



Glu Arg Asp Ser Glu Gly Thr Leu Ser Val Ser Thr Pro Pro Val Leu 
785 790 795 800 



Thr Arg Thr Trp Tyr His Thr Gly Gin Lys Gin Asp Glu Glu Arg Ala 
805 810 815 



Val Gin Gin Tyr Trp Gin Gly Asp Pro Ala Ala Phe Gin Val Lys Pro 
820 825 830 



Val Arg Leu Thr Arg Phe Asp Ala Ala Ala Ala Gin Asp Leu Pro Leu 
835 840 845 



Asp Ser Asn Asn Gly Gin Gin Glu Tyr Trp Leu Tyr Arg Ser Leu Gin 
850 855 " 860 



Gly Met Pro Leu Arg Thr Glu He Phe Ala Gly Asp Val Gly Gly Ser 
865 870 875 880 



Pro Pro Tyr Gin Val Glu Ser Phe Arg Tyr Gin Val Arg Leu Val Gin 
885 890 895 



Ser He Asp Ser Glu Cys Val Ala Leu Pro Met Gin Leu Glu Gin Leu 
900 905 910 



Thr Tyr Asn Tyr Glu Gin He Ala Ser Asp Pro Gin Cys Ser Gin Gin 
915 920 925 



lie Gin Gin Trp Phe Asp Glu Tyr Gly Val Ala Ala Gin Ser Val Thr 
930 935 940 
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lie Gin Tyr Pro Arg Arg Ala Gin Pro Glu Asp Asn Pro Tyr Pro Arg 
945 950 955 960 



Thr Leu Pro Asp Thr Ser Trp Ser Ser Ser Tyr Asp Ser Gin Gin Met 
965 970 975 



Leu Leu Arg Leu Thr Arg Gin Arg Gin Lys Ala Tyr His Leu Ala Asp 
980 985 990 



Pro Glu Gly Trp Arg Leu Asn He Pro His Gin Thr Arg Leu Asp Ala 
995 1000 1005 



Phe He Tyr Ser Ala Asp Ser Val Pro Ala Glu Gly He Ser Ala 
1010 1015 1020 



Glu Leu Leu Glu Val Asp Gly Thr Leu Arg Ser Ser Ala Leu Glu 
1025 1030 1035 



Gin Ala Tyr Gly Gly Gin Ser Glu He He Tyr Ala Gly Gly Gly 
1040 1045 1050 



Glu Pro Asp Leu Arg Ala Leu Val His Tyr Thr Arg Ser Ala Val 
1055 1060 1065 



Leu Asp Glu Asp Cys Leu Gin Ala Tyr Glu Gly Val Leu Ser Asp 
1070 1075 1080 



Ser Gin Leu Asn Ser Leu Leu Ala Ser Ser Gly Tyr Gin Arg Ser 
1085 1090 1095 



Ala Arg He Leu Gly Ser Gly Asp Glu Val Asp He Phe Val Ala 
1100 1105 1110 



Glu Gin Gly Phe Thr Arg Tyr Ala Asp Glu Pro Asn Phe Phe Arg 
1115 1120 1125 



He Leu Gly Gin Gin Ser Ser Leu Leu Ser Gly Glu Gin Val Leu 
1130 1135 H40 



Thr Trp Asp Asp Asn Phe Cys Ala Val Thr Ser He Glu Asp Ala 
1145 " 1150 1155 



Leu Gly Asn Gin He Gin He Ala Tyr Asp Tyr Arg Phe Val Glu 
1160 1165 1170 
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Ala He Gin He Thr Asp Thr Asn Asn Asn Val Asn Gin Val Ala 
1175 H80 H85 



Leu Asp Ala Leu Gly Arg Val Val Tyr Ser Arg Thr Trp Gly Thr 
1190 H95 1200 



Glu Glu Gly He Lys Thr Gly Phe Arg Pro Glu Val Glu Phe Ala 
1205 1210 1215 



Thr Pro Glu Thr Met Glu Gin * Ala Leu Ala Leu Ala Ser Pro Leu 
1220 1225 1230 



Pro Val Ala Ser Cys Cys Val Tyr Asp Ala His Ser Trp Met Gly 
1235 1240 1245 



Thr He Thr Leu Ala Gin Leu Ser Glu Leu Val Pro Asp Ser Glu 
1250 1255 1260 



Lys Gin Trp Ser Phe Leu He Asp Asn Arg Leu He Met Pro Asp 
12 65 1270 1275 



Gly Arg He Arg Ser Arg Gly Arg Asp Pro Trp Ser Leu His Arg 
1280 1285 1290 



Leu Leu Pro Pro Ala Val Gly Glu Leu Leu Ser Glu Ala Asp Arg 
1295 1300 1305 



Lys Pro Pro His Thr Val He Leu Ala Ala Asp Arg Tyr Pro Asp 
1310 1315 1320 



Asp Pro Ser Gin Gin He Gin Ala Ser He Val Phe Ser Asp Gly 
1325 1330 1335 



Phe Gly Arg Thr He Gin Thr Ala Lys Arg Glu Asp Thr Arg Trp 
1340 1345 1350 



Ala He Ala Glu Arg Val Asp Tyr Asp Gly Thr Gly Ala Val He 
1355 1360 & 1365 



Arg Ser Phe Gin Pro Phe Tyr Leu Asp Asp Trp Asn Tyr Val Gly 
1370 1375 1380 



Glu Glu Ala Val Ser Ser Ser Met Tyr Ala Thr He Tyr Tyr Tyr 
1385 1390 * 1395 
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Asp Ala Leu Ala Arg Gin Leu Arg Met Val Asn Ala Lys Gly Tyr 
1400 1405 1410 



Glu Arg Arg Thr Ala Phe Tyr Pro Tip Phe Thr Val Asn Glu Asp 
1415 1420 1425 



Glu Asn Asp Thr Met Asp Ser Ser Leu Phe Ala Ser Pro Pro Ala 
1430 1435 1440 



Arg 



<210> 12 

<211> 2793 

<212> DNA 

<213> Paenibacillus strain IDAS 1529 



<400> 12 



dLyadLciuaa 




i« ay yy y v~av~y 




way l. uy Lyya 


uaai_L.y yactb 




t* t~ on a era t* t* n 


rrpa t* t* c t" t* 

y l»a L. L. t» L> L. I* CI 


at* at*a3 r»pert~ 


at* cci enact a 


aaya u^.^.y y l» 


ay a uy ay uy l 


12 0 




a ra pert - a t* a p 


a p not" t~ a a err* 


t" a t" c t~ t acr p a 


y way i«a tyya 


t - rrrnah t~ t~a 

i>^<^^i<yi>i> y 


180 


ttctcgcaat 


atcaggatga 


tegeggaaca 


ccgccgaata 


tacgaaccat 


ggcttccctg 


240 


agaggegaag 


cgctgtgttc 


ggaaagtgtg 


gatgccggcc 


geaaggegga 


gctttttgat 


300 


atcgaggggc 


ggcccgtctg 


gcttatcgat 


gccaacggca 


cagagacgac 


tctcgaatat 


360 


gatgtcttag 


gcaggccaac 


ageegtatte 


gagcaacagg 


aaggtacgga 


ctccccccag 


420 


tgeagggage 


ggtttattta 


tggtgagaag 


gaggeggatg 


cccaggccaa 


caatttgege 


480 


ggacaactgg 


ttcgccacta 


cgataccgcg 


ggceggatae 


agaccgacag 


catctccttg 


54 0 


gctggactgc 


cgttgcgcca 


aagccgtcaa 


ctgctgaaaa 


attgggatga 


acctggcgac 


600 


tggagtatgg 


atgaggaaag 


cgcctgggcc 


tcgttgctgg 


ctgccgaagc 


ttatgatacg 


660 


agctggcggt 


atgacgegea 


ggacagggtg 


ctcgcccaaa 


ccgacgccaa 


agggaatctc 


720 


cagcaactga 


cttacaatga 


cgccggccag 


ccgcaggcgg 


tcagcctcaa 


getgeaagge 


7B0 


caageggage 


aacggatttg 


gaaceggate 


gagtacaacg 


eggegggtea 


agtggatctc 


840 


gccgaagccg 


ggaatggaat 


cgtaacggaa 


tatacttacg 


aggaaagcac 


geageggtta 


900 


atccgaaaaa 


aagattcccg 


cggactgtcc 


teeggggaaa 


gagaagtget 


gcaggattat 


960 


cgttatgaat 


atgatccggt 


aggcaatatc 


ctttctattt 


acaatgaagc 


ggagccggtt 


1020 


cgttatttcc 


gcaatcaggc 


cgttgctccg 


aaaaggcaat 


atgectaega 


tgccttgtat 


1080 


cagcttgtat 


ctagttcggg 


gegggaatec 


gacgcgcttc 


ggcagcagac 


gtcgcttcct 


1140 


cccttgatca 


cgcctatccc 


tctggacgat 


agecaatacg 


teaattaege 


tgaaaaatac 


1200 
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agctatgatc aggcgggcaa tttaatcaag cttagccata acggggcaag tcaatataca 1260 

acgaatgtgt atgtggacaa aagctcaaac cgggggattt ggcggcaagg ggaagacatc 1320 

ccggatatcg cggcttcctt tgacagagca ggcaatcaac aagctttatt cccggggaga 1380 

ccgttggaat gggatacacg caatcaatta agccgtgtcc atatggtcgt gcgcgaaggc 1440 

ggagacaacg actgggaagg ctatctctat gacagctcgg gaatgcgtat cgtaaaacga 1500 

tctacccgca aaacacagac aacgacgcaa acggatacga ccctctattt gccgggcctg 1560 

gagctgcgaa tccgccagac cggggaccgg gtcacggaag cattgcaggt cattaccgtg 1620 

gatgagggag cgggacaagt gagggtgctg cactgggagg atggaaccga gccgggcggc 1680 

atcgccaatg atcagtaccg gtacagcctg aacgatcatc ttacctcctc tttattggaa 1740 

gttgacgggc aaggtcagat cattagtaag gaagaatttt atccctatgg cggcacagcc 1800 

ctgtggacag cccggtcaga ggtagaggca agctacaaga ccatccgcta ttcaggcaaa 1860 

gagcgggatg ccacaggcct gtattattac ggacaccgct actatatgcc atggttgggt 1920 

cgctggctga atccggaccc ggccggaatg gtagatggac taaacctgta ccgtatggtc 1980 

aggaacaatc ctataggact gatggatccg aatgggaatg cgccaatcaa cgtggcggat 2040 

tatagcttcg tgcatggtga tttagtttat ggtcttagta aggaaagagg aagatatcta 2100 

aagctattta atccaaactt taatatggaa aaatcagact ctcctgctat ggttatagat 2160 

caatataata ataatgttgc attgagtata actaaccaat ataaagtaga agaattgatg 2220 

aaatttcaaa aagacccaca aaaagccgca cggaaaataa aggttccaga agggaatcgt 2280 

ttatcgagga acgaaaatta tcctttgtgg cacgattata ttaacattgg agaagctaaa 2340 

gctgcattta aggcctctca tattttccaa gaagtgaagg ggaattatgg gaaagattat 2400 

tatcataaat tattattaga cagaatgata gaatcgccgt tgctgtggaa acgaggcagc 2460 

aaactcgggc tagaaatcgc cgctaccaat cagagaacaa aaatacactt tgttcttgac 2520 

aatttaaata tcgagcaggt ggttacgaaa gagggtagcg gcggtcagtc aatcacagct 2580 

tcggagctcc gttatattta tcgaaatcgc gaaagattga acgggcgtgt cattttctat 2640 

agaaataatg aaaggctaga tcaggctcca tggcaagaaa atccggactt atggagcaaa 2700 

tatcaaccgg gtcttagaca aagcagcagt tcaagagtca aagaacgagg gattgggaac 2760 

tttttccgcc ggttttcaat gaagagaaag tag 2793 

<210> 13 
<211> 930 
<212> PRT 

<213> Paenibacillus strain IDAS 1529 
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<400> 13 

Met Asn Thr Thr Ser He Tyr Arg Gly Thr Pro Thr lie Ser Val Val 
1 5 10 15 



Asp Asn Arg Asn Leu Glu He Arg He Leu Gin Tyr Asn Arg He Ala 
20 25 30 



Ala Glu Asp Pro Ala Asp Glu Cys He Leu Arg Asn Thr Tyr Thr Pro 
35 40 45 



Leu Ser Tyr Leu Gly Ser Ser Met Asp Pro Arg Leu Phe Ser Gin Tyr 
50 55 60 



Gin Asp Asp Arg Gly Thr Pro Pro Asn He Arg Thr Met Ala Ser Leu 
65 70 75 80 



Arg Gly Glu Ala Leu Cys Ser Glu Ser Val Asp Ala Gly Arg Lys Ala 
85 90 95 



Glu Leu Phe Asp He Glu Gly Arg Pro Val Trp Leu He Asp Ala Asn 
100 105 no 



Gly Thr Glu Thr Thr Leu Glu Tyr Asp Val Leu Gly Arg Pro Thr Ala 
115 120 125 



Val Phe Glu Gin Gin Glu Gly Thr Asp Ser Pro Gin Cys Arg Glu Arg 
130 135 140 



Phe He Tyr Gly Glu Lys Glu Ala Asp Ala Gin Ala Asn Asn Leu Arg 
145 150 155 160 



Gly Gin Leu Val Arg His Tyr Asp Thr Ala Gly Arg He Gin Thr Asp 
165 170 175 



Ser He Ser Leu Ala Gly Leu Pro Leu Arg Gin Ser Arg Gin Leu Leu 
180 185 190 



Lys Asn Trp Asp Glu Pro Gly Asp Trp Ser Met Asp Glu Glu Ser Ala 
195 200 205 



Trp Ala Ser Leu Leu Ala Ala Glu Ala Tyr Asp Thr Ser Trp Arg Tyr 
210 215 220 



Asp Ala Gin Asp Arg Val Leu Ala Gin Thr Asp Ala Lys Gly Asn Leu 
225 230 235 240 
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Gin Gin Leu Thr Tyr Asn Asp Ala Gly Gin Pro Gin Ala Val Ser Leu 
245 250 255 



Lys Leu Gin Gly Gin Ala Glu Gin Arg lie Trp Asn Arg He Glu Tyr 
260 265 270 



Asn Ala Ala Gly Gin Val Asp Leu Ala Glu Ala Gly Asn Gly He Val 
275 280 285 



Thr Glu Tyr Thr Tyr Glu Glu Ser Thr Gin Arg Leu He Arg Lys Lys 
290 295 300 



Asp Ser Arg Gly Leu Ser Ser Gly Glu Arg Glu Val Leu Gin Asp Tyr 
3°5 310 315 320 



Arg Tyr Glu Tyr Asp Pro Val Gly Asn He Leu Ser He Tyr Asn Glu 
325 330 335 



Ala Glu Pro Val Arg Tyr Phe Arg Asn Gin Ala Val Ala Pro Lys Arg 
340 345 350 



Gin Tyr Ala Tyr Asp Ala Leu Tyr Gin Leu Val Ser Ser Ser Gly Arg 
355 360 365 



Glu Ser Asp Ala Leu Arg Gin Gin Thr Ser Leu Pro Pro Leu He Thr 
370 375 380 



Pro He Pro Leu Asp Asp Ser Gin Tyr Val Asn Tyr Ala Glu Lys Tyr 
385 390 395 400 



Ser Tyr Asp Gin Ala Gly Asn Leu He Lys Leu Ser His Asn Gly Ala 
405 410 415 



Ser Gin Tyr Thr Thr Asn Val Tyr Val Asp Lys Ser Ser Asn Arg Gly 
420 425 430 



He Trp Arg Gin Gly Glu Asp He Pro Asp He Ala Ala Ser Phe Asp 
435 440 445 



Arg Ala Gly Asn Gin Gin Ala Leu Phe Pro Gly Arg Pro Leu Glu Trp 
450 455 460 



Asp Thr Arg Asn Gin Leu Ser Arg Val His Met Val Val Arg Glu Gly 
465 470 475 480 
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Gly Asp Asn Asp Trp Glu Gly Tyr Leu Tyr Asp Ser Ser Gly Met Arg 
485 490 495 



He Val Lys Arg Ser Thr Arg Lys Thr Gin Thr Thr Thr Gin Thr Asp 
500 505 510 



Thr Thr Leu Tyr Leu Pro Gly Leu Glu Leu Arg He Arg Gin Thr Gly 
515 520 525 



Asp Arg Val Thr Glu Ala Leu Gin Val He Thr Val Asp Glu Gly Ala 
530 535 540 



Gly Gin Val Arg Val Leu His Trp Glu Asp Gly Thr Glu Pro Gly Gly 
545 550 555 560 



He Ala Asn Asp Gin Tyr Arg Tyr Ser Leu Asn Asp His Leu Thr Ser 
565 570 575 



Ser Leu Leu Glu Val Asp Gly Gin Gly Gin He He Ser Lys Glu Glu 
580 585 590 



Phe Tyr Pro Tyr Gly Gly Thr Ala Leu Trp Thr Ala Arg Ser Glu Val 
595 600 605 



Glu Ala Ser Tyr Lys Thr He Arg Tyr Ser Gly Lys Glu Arg Asp Ala 
610 615 620 



Thr Gly Leu Tyr Tyr Tyr Gly His Arg Tyr Tyr Met Pro Trp Leu Gly 
625 630 635 640 



Arg Trp Leu Asn Pro Asp Pro Ala Gly Met Val Asp Gly Leu Asn Leu 
645 650 655 



Tyr Arg Met Val Arg Asn Asn Pro lie Gly Leu Met Asp Pro Asn Gly 
660 665 670 



Asn Ala Pro He Asn Val Ala Asp Tyr Ser Phe Val His Gly Asp Leu 
675 680 685 



Val Tyr Gly Leu Ser Lys Glu Arg Gly Arg Tyr Leu Lys Leu Phe Asn 
690 695 700 



Pro Asn Phe Asn Met Glu Lys Ser Asp Ser Pro Ala Met Val He Asp 
705 710 715 720 



Gin Tyr Asn Asn Asn Val Ala Leu Ser 



He Thr Asn Gin Tyr Lys Val 
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725 730 735 



Glu Glu Leu Met Lys Phe Gin Lys Asp Pro Gin Lys Ala Ala Arg Lys 
740 745 * 750 



He Lys Val Pro Glu Gly Asn Arg Leu Ser Arg Asn Glu Asn Tyr Pro 
755 760 765 



Leu Trp His Asp Tyr He Asn He Gly Glu Ala Lys Ala Ala Phe Lys 
770 775 780 



Ala Ser His He Phe Gin Glu Val Lys Gly Asn Tyr Gly Lys Asp Tyr 
7 B5 790 795 ~ BOO 



Tyr His Lys Leu Leu Leu Asp Arg Met He Glu Ser Pro Leu Leu Trp 
805 810 815 



Lys Arg Gly Ser Lys Leu Gly Leu Glu He Ala Ala Thr Asn Gin Arg 
820 825 830 



Thr Lys He His Phe Val Leu Asp Asn Leu Asn He Glu Gin Val Val 
835 840 845 



Thr Lys Glu Gly Ser Gly Gly Gin Ser lie Thr Ala Ser Glu Leu Arg 
850 855 860 



Tyr He Tyr Arg Asn Arg Glu Arg Leu Asn Gly Arg Val He Phe Tyr 
865 870 875 880 



Arg Asn Asn Glu Arg Leu Asp Gin Ala Pro Trp Gin Glu Asn Pro Asp 
885 890 895 



Leu Trp Ser Lys Tyr Gin Pro Gly Leu Arg Gin Ser Ser Ser Ser Arg 
900 905 910 



Val Lys Glu Arg Gly He Gly Asn Phe Phe Arg Arg Phe Ser Met Lys 
915 920 925 



Arg Lys 
930 



<210> 14 

<211> 1791 

<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Nucleic acid sequence of ORF7, which encodes a cry-like protein - 
<400> 14 



atgaactcaa 


atgaaccaaa 


tttatctgat 


gttgttaatt 


gtttaagtga 


ccccaatagt 


60 


gacttggaga 


agtctggcgg 


tggagtagcg 


ctagatgttg 


gaatgtcatt 


gatatccgaa 


120 


cttcttggta 


cggttccagt 


tgctggatca 


attcttcaat 


ttgtattcga 


taaattgtgg 


180 


tttatttttg 


gcccttctga 


gtgggactca 


cttatggaac 


atgttgaagc 


attaattgat 


240 


agtaaaatac 


aagagcaggt 


aaaaagaagt 


gcacaagatg 


aactaaatgc 


aattacaaat 


300 


aacttatcta 


cgtatttgaa 


atttctagat 


gcatgggaaa 


atgattctaa 


taatttaaga 


360 


gcgagagctg 


tagtgaaaga 


ccaatttgta 


ggccttgaac 


agactcttga 


aagaaaaatg 


420 


gttagtgttt 


ttggaagtac 


gggtcatgaa 


gtgcatcttt 


tgccaatttt 


cgctcaagca 


480 


gccaacctcc 


acctaattct 


attaagagat 


gctgagaaat 


atggaaagag 


atggggttgg 


540 


gcagatagag 


aaattcaagt 


atattatgat 


aaccagattc 


gttatatcca 


tgaatatacg 


600 


gaccattgta 


ttaaatatta 


taatcaagga 


ttaagtaaac 


tgaaaggttc 


tacctatcaa 


660 


gattgggata 


agtataatcg 


ttttagaaga 


gaaatgaccc 


taactgttct 


tgatttgatt 


720 


tcaattttcc 


catcgtatga 


tactagaact 


tacccaattg 


atacaatagg 


tcaattgaca 


780 


agggaagttt 


attcggattt 


acttattgct 


aacccgtctg 


ggatgcagac 


tttcactaat 


840 


gtagatttcg 


acaatattct 


tattagaaaa 


cctcatttaa 


tggatttctt 


aagaactctt 


900 


gagattttta 


ccgatcgaca 


taacgcaagc 


agacacaacg 


tatattgggg 


cggacatcga 


960 


gtgcattctt 


cttacacagg 


aggtaatttt 


gaaaattttg 


aatctccctt 


atatggcagt 


1020 


gaagcaaatg 


tagaaccccg 


aacatggttg 


agttttggag 


aatctcaagt 


ctataatata 


1080 


cgttcgaagc 


ctgagtggga 


tagaggaagt 


actgcaatta 


gtggctccta 


tgaatttcga 


1140 


ggagtgacag 


gatgttcttt 


ttatcgaatg 


ggaaattttg 


ctggcaccgt 


agccctaact 


1200 


taccgacagt 


ttggtaacga 


aggttctcaa 


atcccattgc 


acaggctatg 


tcatgttact 


1260 


tattttagaa 


gatctcaagc 


tgtgggggcg 


acttcgagac 


agacgttaac 


aagtggtccg 


1320 


ctattttcct 


ggacacatag 


tagtgctacg 


gaaacgaata 


tcattcaccc 


gacaaaaatt 


1380 


acacaaatac 


caatggtgaa 


ggctagttcc 


cttggatcag 


gtacttctgt 


tgtccaagga 


1440 


ccaggcttta 


caggagggga 


tgtacttcga 


agaaatagcc 


ccggtagcac 


aggaacttta 


1500 


agagttaacg 


tcaattcacc 


attatcacag 


agatatcgta 


taagaattcg 


ttacgcttct 


1560 


actacggatt 


tagatttttt 


tgtcattcgc 


ggaaatacga 


cagttaataa 


ttttagattt 


1620 


gggaacacta 


tgcgtaaagg 


agaccctata 


acctctcgat 


catttagatt 


tgcggctttt 


1680 


agtacaccat 


ttacttttgc 


tagctcacag 


gatgaactta 


gaataaatgt 


acaaaatttc 


1740 
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aataatggtg aagaagttta tatagataga atcgaagtta ttccagtttg a 1791 

<210> 15 

<211> 596 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Amino acid sequence encoded by ORF7. 

<400> 15 

Met Asn Ser Asn Glu Pro Asn Leu Ser Asp Val Val Asn Cys Leu Ser 
1.5 10 15 

Asp Pro Asn Ser Asp Leu Glu Lys Ser Gly Gly Gly Val Ala Leu Asp 
20 25 - 30 

Val Gly Met Ser Leu He Ser Glu Leu Leu Gly Thr Val Pro Val Ala 
35 40 45 

Gly Ser He Leu Gin Phe Val Phe Asp Lys Leu Trp Phe He Phe Gly 
50 55 so 

Pro Ser Glu Trp Asp Ser Leu Met Glu His Val Glu Ala Leu He Asp 
65 70 75 80 

Ser Lys He Gin Glu Gin Val Lys Arg Ser Ala Gin Asp Glu Leu Asn 
85 90 95 

Ala He Thr Asn Asn Leu Ser Thr Tyr Leu Lys Phe Leu Asp Ala Trp 
100 105 no 

Glu Asn Asp Ser Asn Asn Leu Arg Ala Arg Ala Val Val Lys Asp Gin 
I 15 120 125 

Phe Val Gly Leu Glu Gin Thr Leu Glu Arg Lys Met Val Ser Val Phe 
130 ' 135 140 

Gly Ser Thr Gly His Glu Val His Leu Leu Pro He Phe Ala Gin Ala 
145 150 155 160 

Ala Asn Leu His Leu He Leu Leu Arg Asp Ala Glu Lys Tyr Gly Lys 
165 170 i7 5 

Arg Trp Gly Trp Ala Asp Arg Glu He Gin Val Tyr Tyr Asp Asn Gin 
180 185 190 



WO 2004/002223 



PCT7US2003/020082 



64 

He Arg Tyr He His Glu Tyr Thr Asp His Cys He Lys Tyr Tyr Asn 
195 200 205 



Gin Gly Leu Ser Lys Leu Lys Gly Ser Thr Tyr Gin Asp Trp Asp Lys 
210 215 220 



Tyr Asn Arg Phe Arg Arg Glu Met Thr Leu Thr Val Leu Asp Leu He 
225 230 235 240 



Ser He Phe Pro Ser Tyr Asp Thr Arg Thr Tyr Pro He Asp Thr lie 
245 250 255 



Gly Gin Leu Thr Arg Glu Val Tyr Ser Asp Leu Leu He Ala Asn Pro 
260 265 270 



Ser Gly Met Gin Thr Phe Thr Asn Val Asp Phe Asp Asn He Leu He 
275 280 285 



Arg Lys Pro His Leu Met Asp Phe Leu Arg Thr Leu Glu He Phe Thr 
290 295 300 



Asp Arg His Asn Ala Ser Arg His Asn Val Tyr Trp Gly Gly His Arg 
305 310 315 320 



Val His Ser Ser Tyr Thr Gly Gly Asn Phe Glu Asn Phe Glu Ser Pro 
325 330 335 



Leu Tyr Gly Ser Glu Ala Asn Val Glu Pro Arg Thr Trp Leu Ser Phe 
340 345 350 



Gly Glu Ser Gin Val Tyr Asn He Arg Ser Lys Pro Glu Trp Asp Arg 
355 360 365 



Gly Ser Thr Ala He Ser Gly Ser Tyr Glu Phe Arg Gly Val Thr Gly 
370 375 380 



Cys Ser Phe Tyr Arg Met Gly Asn Phe Ala Gly Thr Val Ala Leu Thr 
385 390 395 400 



Tyr Arg Gin Phe Gly Asn Glu Gly Ser Gin He Pro Leu His Arg Leu 
405 410 415 



Cys His Val Thr Tyr Phe Arg Arg Ser Gin Ala Val Gly Ala Thr Ser 
420 425 430 



Arg Gin Thr Leu Thr Ser Gly Pro Leu Phe Ser Trp Thr His Ser Ser 
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435 440 445 

Ala Thr Glu Thr Asn lie lie His Pro Thr Lys He Thr Gin He Pro 
450 455 460 



Met Val Lys Ala Ser Ser Leu Gly Ser Gly Thr Ser Val Val Gin Gly 
465 470 475 480 

Pro Gly Phe Thr Gly Gly Asp Val Leu Arg Arg Asn Ser Pro Gly Ser 
485 490 495 

Thr Gly Thr Leu Arg Val Asn Val Asn Ser Pro Leu Ser Gin Arg Tyr 
500 505 510 

Arg He Arg He Arg Tyr Ala Ser Thr Thr Asp Leu Asp Phe Phe Val 
515 520 525 

He Arg Gly Asn Thr Thr Val Asn Asn Phe Arg Phe Gly Asn Thr Met 
530 535 540 



Arg Lys Gly Asp Pro He Thr Ser Arg Ser Phe Arg Phe Ala Ala Phe 
545 550 555 560 

Ser Thr Pro Phe Thr Phe Ala Ser Ser Gin Asp Glu Leu Arg He Asn 
565 570 575 

Val Gin Asn Phe Asn Asn Gly Glu Glu Val Tyr lie Asp Arg He Glu 
580 585 * 590 



Val lie Pro Val 
595 



<210> 16 
<211> 1547 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Nucleic acid sequence of the 16S rDNA of IDAS1529. 
<400> 16 

tggagagttt gatcctggct caggacgaac gctggcggcg tgcctaatac atgcaagtcg 60 
agcggakcaa cggtttcctt cgggaaaccr ttagcttagc ggcggacggg tgagtaatac 120 
gtaggtaacc tgcccttaag accgggataa ctcacggaaa cgtgggctaa taccggatag ieo 
gcgatttcct cgcatgaggg aatcgggaaa ggcggagcaa tctgccactt atggatggac 240 
ctacggcgca ttagctagtt ggtggggtaa cggctcacca aggcgacgat gcgtagccga 300 
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cctgagaggg 


tgatcggcca 


cactgggact 


gagacacggc 


ccagactcct acgggaggca 


360 


gcagtaggga 


atcttccgca 


atggacgcaa 


gtctgacgga 


gcaacgccgc gtgagtgatg 


420 


aaggttttcg 


gatcgtaaag 


ctctgttgcc 


agggaagaac 


gctatggaga gtaactgttc 


• 480 


cataggtgac 


ggtacctgag 


aagaaagccc 


cggctaacta 


cgtgccagca gccgcggtaa 


540 


tacgtagggg 


gcaagcgttg 


tccggaatta 


ttgggcgtaa 


agcgcgcgca ggcggtcatg 


600 


taagtctggt 


gtttaaaccc 


ggggctcaac 


tccgggtcgc 


atcggaaact gtgtgacttg 


660 


agtgcagaag 


aggaaagtgg 


aattccacgt 


gtagcggtga 


aatgcgtaga gatgtggagg 


720 


aacaccagtg 


gcgaaggcga 


ctttctgggc 


tgtaactgac 


gctgaggcgc gaaagcgtgg 


780 


ggagcaaaca 


ggattagata 


ccctggtagt 


ccacgccgta 


aacgatgaat gctaggtgtt 


840 


aggggtttcg 


atacccttgg 


tgccgaagtt 


aacacattaa 


gcattccgcc tggggagtac 


900 


ggtcgcaaga 


ctgaaactca 


aaggaattga 


cggggacccg 


cacaagcagt ggagtatgtg 


960 


gtttaattcg 


aagcaacgcg 


aagaacctta 


ccaggtcttg 


acatccctct gaccgtccta 


1020 


gagatagggc 


ttcccttcgg 


ggcagaggag 


acaggtggtg 


catggttgtc gtcagctcgt 


1080 


gtcgtgagat 


gttgggttaa 


gtcccgcaac 


gagcgcaacc 


cttaacttta gttgccagca 


1140 


ttaagttggg 


cactctagag 


tgactgccgg 


tgacaaaccg 


gaggaaggtg gggatgacgt 


1200 


caaatcatca 


tgccccttat 


gacctgggct 


acacacgtac 


tacaatggct ggtacaacgg 


1260 


gaagcgaagc 


cgcgaggtgg 


agcgaatcct 


aaaaagccag 


tctcagttcg gattgcaggc 


1320 


tgcaactcgc 


ctgcatgaag 


tcggaattgc 


tagtaatcgc 


ggatcagcat gcc'gcggtga 


1380 


atacgttccc 


gggtcttgta 


cacaccgccc 


gtcacaccac 


gagagtttac aacacccgaa 


1440 


gtcggtgggg 


taaccgcaag 


gagccagccg 


ccgaaggtgg 


ggtagatgat tggggtgaag 


1500 


tcgtaacaag 


gtagccgtat 


cggaaggtgc 


ggytggatca 


cctcctt 


1547 



<210> 17 

<211> 20 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> N-terminal amino acid sequence for the purified toxin from the 
broth fraction from IDAS1529. 

<400> 17 

Asp lie Thr Leu Lys Val Ala He Tyr Pro Tyr Val Pro Asp Pro Ser 
15 10 15 

Arg Phe Gin Ala 
20 
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<210> 18 

<211> 379 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Amino acid sequence of thiaminase I from Bacillus 
thiaminolyticus . 

<400> 18 

Ala His Ser Asp Ala Ser Ser Asp lie Thr Leu Lys Val Ala lie Tyr 
15 10 15 



Pro Tyr Val Pro Asp Pro Ala Arg Phe Gin Ala Ala Val Leu Asp Gin 
20 25 30 



Trp Gin Arg Gin Glu Pro Gly Val Lys Leu Glu Phe Thr Asp Trp Asp 
35 40 45 



Ser Tyr Ser Ala Asp Pro Pro Asp Asp Leu Asp Val Phe Val Leu Asp 
50 55 60 



Ser lie Phe Leu Ser His Phe Val Asp Ala Gly Tyr Leu Leu Pro Phe 
65 70 75 ~ 80 



Gly Ser Gin Asp He Asp Gin Ala Glu Asp Val Leu Pro Phe Ala Leu 
85 90 95 



Gin Gly Ala Lys Arg Asn Gly Glu Val Tyr Gly Leu Pro Gin He Leu 
100 105 110 



Cys Thr Asn Leu Leu Phe Tyr Arg Lys Gly Asp Leu Lys He Gly Gin 
115 120 125 



Val Asp Asn He Tyr Glu Leu Tyr Lys Lys He Gly Thr Ser His Ser 
130 135 140 



Glu Gin He Pro Pro Pro Gin Asn Lys Gly Leu Leu He Asn Met Ala 
145 150 155 160 



Gly Gly Thr Thr Lys Ala Ser Met Tyr Leu Glu Ala Leu He Asp Val 
165 170 175 



Thr Gly Gin Tyr Thr Glu Tyr Asp Leu Leu Pro Pro Leu Asp Pro Leu 
180 185 190 
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Asn Asp Lys Val He Arg Gly Leu Arg Leu Leu He Asn Met Ala Gly 
195 200 205 



Glu Lys Pro Ser Gin Tyr Val Pro Glu Asp Gly Asp Ala Tyr Val Arg 
210 215 220 



Ala Ser Trp Phe Ala Gin Gly Ser Gly Arg Ala Phe He Gly Tyr Ser 
225 230 235 240 



Glu Ser Met Met Arg Met Gly Asp Tyr Ala Glu Gin Val Arg Phe L"s 
245 250 255 



Pro He Ser Ser Ser Ala Gly Gin Asp He Pro Leu Phe Tyr Ser Asp 
260 265 270 



Val Val Ser Val Asn Ser Lys Thr Ala His Pro Glu Leu Ala Lys Lys 
275 280 285 



Leu Ala Asn Val Met Ala Ser Ala Asp Thr Val Glu Gin Ala Leu Arg 
290 295 300 



Pro Gin Ala Asp Gly Gin Tyr Pro Gin Tyr Leu Leu Pro Ala Arg His 
305 310 315 320 



Gin Val Tyr Glu Ala Leu Met Gin Asp Tyr Pro He Tyr Ser Glu Leu 
325 330 ~ 335 



Ala Gin He Val Asn Lys Pro Ser Asn Arg Val Phe Arg Leu Gly Pro 
340 345 350 



Glu Val Arg Thr Trp Leu Lys Asp Ala Lys Gin Val Leu Pro Glu Ala 
355 360 365 



Leu Gly Leu Thr Asp Val Ser Ser Leu Ala Ser 
370 375 



<210> 19 
<211> 953 
<212> PRT 

<213> Paenibacillus strain IDAS 1529 
<400> 19 , 

Met Lys Met lie Pro Trp Thr His His Tyr Leu Leu His Arg Leu Arg 
15 io 15 



Gly Glu Met Glu Val Lys Pro Met Asn Thr Thr Ser He Tyr Arg Gly 
20 25 30 
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Thr Pro Thr lie Ser Val Val Asp Asn Arg Asn Leu Glu He Arg lie 



35 40 



45 



Leu Gin Tyr Asn Arg He Ala Ala Glu Asp Pro Ala Asp Glu Cys lie 
50 55 GO 

Leu Arg Asn Thr Tyr Thr Pro Leu Ser Tyr Leu Gly Ser Ser Met Asp 
65 70 75 bo 

Pro Arg Leu Phe Ser Gin Tyr Gin Asp Asp Arg Gly Thr Pro Pro Asn 
85 90 95 

lie Arg Thr Met Ala Ser Leu Arg Gly Glu Ala Leu Cys Ser Glu Ser 
100 105 no 

Val Asp Ala Gly Arg Lys Ala Glu Leu Phe Asp He Glu Gly Arg Pro 
115 120 125 

Val Trp Leu He Asp Ala Asn Gly Thr Glu Thr Thr Leu Glu Tyr Asp 
130 "5 140 ^ P 

Val Leu Gly Arg Pro Thr Ala Val Phe Glu Gin Gin Glu Gly Thr Asp 
5 150 155 lso 

Ser Pro Gin Cys Arg Glu Arg Phe He Tyr Gly Glu Lys Glu Ala Asp 
l g 5 170 i7 5 

Ala Gin Ala Asn Asn Leu Arg Gly Gin Leu Val Arg His Tyr Asp Thr 
180 185 " i9 0 

Ala Gly Arg He Gin Thr Asp Ser He Ser Leu Ala Gly Leu Pro Leu 
195 200 205 

Arg Gin Ser Arg Gin Leu Leu Lys Asn Trp Asp Glu Pro Gly Asp Trp 
210 215 220 

Ser Met Asp Glu Glu Ser Ala Trp Ala Ser Leu Leu Ala Ala Glu Ala 
225 230 235 240 

Tyr Asp Thr Ser Trp Arg Tyr Asp Ala Gin Asp Arg Val Leu Ala Gin 
245 250 255 

Thr Asp Ala Lys Gly Asn Leu Gin Gin Leu Thr Tyr Asn Asp Ala Gly 
260 265 270 
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Gin Pro Gin Ala Val Ser Leu Lys Leu Gin Gly Gin Ala Glu Gin Arg 
275 280 285 



He Trp Asn Arg lie Glu Tyr Asn Ala Ala Gly Gin Val Asp Leu Ala 
290 ~ 295 300 



Glu Ala Gly Asn Gly He Val Thr Glu Tyr Thr Tyr Glu Glu Ser Thr 
305 310 315 320 



Gin Arg Leu He Arg Lys Lys Asp Ser Arg Gly Leu Ser Ser Gly Glu 
325 330 335 



Arg Glu Val Leu Gin Asp Tyr Arg Tyr Glu Tyr Asp Pro Val Gly Asn 
340 345 350 



He Leu Ser He Tyr Asn Glu Ala Glu Pro Val Arg Tyr Phe Arg Asn 
355 . 360 365 



Gin Ala Val Ala Pro Lys Arg Gin Tyr Ala Tyr Asp Ala Leu Tyr Gin 
370 375 380 



Leu Val Ser Ser Ser Gly Arg Glu Ser Asp Ala Leu Arg Gin Gin Thr 
385 390 395 400 



Ser Leu Pro Pro Leu He Thr Pro He Pro Leu Asp Asp Ser Gin Tyr 
405 410 415 



Val Asn Tyr Ala Glu Lys Tyr Ser Tyr Asp Gin Ala Gly Asn Leu He 
420 425 430 



Lys Leu Ser His Asn Gly Ala Ser Gin Tyr Thr Thr Asn Val Tyr Val 
435 440 445 



Asp Lys Ser Ser Asn Arg Gly He Trp Arg Gin Gly Glu Asp lie Pro 
450 455 460 



Asp He Ala Ala Ser Phe Asp Arg Ala Gly Asn Gin Gin Ala Leu Phe 
465 470 475 480 



Pro Gly Arg Pro Leu Glu Trp Asp Thr Arg Asn Gin Leu Ser Arg Val 
485 490 495 



His Met Val Val 
500 



Arg Glu Gly Gly Asp Asn Asp Trp Glu Gly Tyr Leu 
505 510 
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Tyr Asp Ser Ser Gly Met Arg He Val Lys Arg Ser Thr Arg Lys Thr 
515 520 ~ 525 



Gin Thr Thr Thr Gin Thr Asp Thr Thr Leu Tyr Leu Pro Gly Leu Glu 
530 535 540 



Leu Arg He Arg Gin Thr Gly Asp Arg Val Thr Glu Ala Leu Gin Val 
545 550 555 560 

He Thr Val Asp Glu Gly Ala Gly Gin Val Arg Val Leu His Trp Glu 
565 570 575 



Asp Gly Thr Glu Pro Gly Gly He Ala Asn Asp Gin Tyr Arg Tyr Ser 
580 585 590 



Leu Asn Asp His Leu Thr Ser Ser Leu Leu Glu Val Asp Gly Gin Gly 
595 600 605 



Gin He He Ser Lys Glu Glu Phe Tyr Pro Tyr Gly Gly Thr Ala Leu 
610 615 620 

Trp Thr Ala Arg Ser Glu Val Glu Ala Ser Tyr Lys Thr He Arg Tyr 
625 630 635 640 

Ser Gly Lys Glu Arg Asp Ala Thr Gly Leu Tyr Tyr Tyr Gly His Arg 
645 650 " 655 



Tyr Tyr Met Pro Trp Leu Gly Arg Trp Leu Asn Pro Asp Pro Ala Gly 
660 665 670 



Met Val Asp Gly Leu Asn Leu Tyr Arg Met Val Arg Asn Asn Pro He 
675 680 685 



Gly Leu Met Asp Pro Asn Gly Asn Ala Pro He Asn Val Ala Asp Tyr 
690 695 700 



Ser Phe Val His Gly Asp Leu Val Tyr Gly Leu Ser Lys Glu Arg Gly 
705 710 715 720 



Arg Tyr Leu Lys Leu Phe Asn Pro Asn Phe Asn Met Glu Lys Ser Asp 
725 730 735 



Ser Pro Ala Met Val He Asp Gin Tyr Asn Asn Asn Val Ala Leu Ser 
740 745 75 0 



He Thr Asn Gin Tyr Lys Val Glu Glu Leu Met Lys Phe Gin Lys Asp 



V 

WO 2004/002223 



PCT/US2003/020082 



72 

755 760 765 



Pro Gin Lys Ala Ala Arg Lys lie Lys Val Pro Glu Gly Asn Arg Leu 
770 775 780 



Ser Arg Asn Glu Asn Tyr Pro Leu Trp His Asp Tyr He Asn He Gly 
785 790 795 800 



Glu Ala Lys Ala Ala Phe Lys Ala Ser His He Phe Gin Glu Val Lys 
805 BlO 815 



Gly Asn Tyr Gly Lys Asp Tyr Tyr His Lys Leu Leu Leu Asp Arg Met 
820 825 830 



He Glu Ser Pro Leu Leu Trp Lys Arg Gly Ser Lys Leu Gly Leu Glu 
835 " 840 845 



lie Ala Ala Thr Asn Gin Arg Thr Lys He His Phe Val Leu Asp Asn 
850 855 860 



Leu Asn He Glu Gin Val Val Thr Lys Glu Gly Ser Gly Gly Gin Ser 
865 870 875 880 



lie Thr Ala Ser Glu Leu Arg Tyr He Tyr Arg Asn Arg Glu Arg Leu 
885 890 895 



Asn Gly Arg Val He Phe Tyr Arg Asn Asn Glu Arg Leu Asp Gin Ala 
900 905 910 



Pro Trp Gin Glu Asn Pro Asp Leu Trp Ser Lys Tyr Gin Pro Gly Leu 
915 920 925 



Arg Gin Ser Ser Ser Ser Arg Val Lys Glu Arg Gly He Gly Asn Phe 
930 935 940 



Phe Arg Arg Phe Ser Met Lys Arg Lys 



945 




950 


<210> 


20 




<211> 


4482 




<212> 


DNA 




<213> 


Xenorhabdus 


strain 


<400> 


20 





atgcagggtt caacaccttt gaaacttgaa ataccgtcat tgccctctgg gggcggatca 60 



ctaaaaggaa tgggagaagc actcaatgcc gtcggagcgg aagggggagc gtcattttca 120 
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ctgcccttgc cgatctctgt cgggcgtggt ctggtgccgg tgctatcact gaattacagc 180 

agtactgccg gcaatgggtc attcgggatg gggtggcaat gtggggttgg ttttatcagc 240 

ctgcgtaccg ccaagggcgt tccgcactat acgggacaag atgagtatct cgggccggat 300 

ggggaagtgt tgagtattgt gccggacagc caagggcaac cagagcaacg caccgcaacc 360 

tcactgttgg ggacggttct gacacagccg catactgtta cccgctatca gtcccgcgtg 420 

gcagaaaaaa tcgttcgttt agaacactgg cagccacagc agagacgtga ggaagagacg 480 

tctttttggg tactttttac tgcggatggt ttagtgcacc tattcggtaa gcatcaccat 540 

gcacgtattg ctgacccgca ggatgaaacc agaattgccc gctggctgat ggaggaaacc 600 

gtcacgcata ccggggaaca tatttactat cactatcggg cagaagacga tcttgactgt 660 

gatgagcatg aacttgctca gcattcaggt gttacggccc agcgttatct ggcaaaagtc 720 

agctatggca atactcagcc ggaaaccgct tttttcgcgg taaaatcagg tattcctgct 780 

gataatgact ggctgtttca tctggtattt gattacggtg agcgctcatc ttcgctgaac 840 

tctgtacccg aattcaatgt gtcagaaaac aatgtgtctg aaaacaatgt gcctgaaaaa 900 

tggcgttgtc gtccggacag tttctcccgc tatgaatatg ggtttgaaat tcgaacccgt 960 

cgcttgtgtc gccaagttct gatgtttcat cagctgaaag cgctggcagg ggaaaaggtt 1020 

gcagaagaaa caccggcgct ggtttcccgt cttattctgg attatgacct gaacaacaag 1080 

gtttccttgc tgcaaacggc ccgcagactg gcccatgaaa cggacggtac gccagtgatg 1140 

atgtccccgc tggaaatgga ttatcaacgt gttaatcatg gcgtgaatct gaactggcag 1200 

tccatgccgc agttagaaaa aatgaacacg ttgcagccat accaattggt tgatttatat 1260 

ggagaaggaa tttccggcgt actttatcag gatactcaga aagcctggtg gtaccgtgct 1320 

ccggtacggg atatcactgc cgaaggaacg aatgcggtta cctatgagga ggccaaacca 1380 

ctgccacata ttccggcaca acaggaaagc gcgatgttgt tggacatcaa tggtgacggg 144 o 

cgtctggatt gggtgattac ggcatcaggg ttacggggct accacaccat gtcaccggaa 1500 

ggtgaatgga caccctttat tccattatcc gctgtgccaa tggaatattt ccatccgcag 1560 

gcaaaactgg ctgatattga tggggctggg ctgcctgact tagcgcttat cgggccaaat 1620 

agtgtacgtg tctggtcaaa taatcgggca ggatgggatc gcgctcagga tgtgattcat 1680 

ttgtcagata tgccactgcc ggttcccggc agaaatgagc gtcatcttgt cgcattcagt 1740 

gatatgacag. gctccgggca atcacatctg gtggaagtaa cggcagatag cgtgcgctac 1800 

tggccgaacc tggggcatgg aaaatttggt gagcctctga tgatgacagg cttccagatt 1860 

agcggggaaa cgtttaaccc cgacagactg tatatggtag acatagatgg ctcaggcacc 1920 

accgatttta tttatgcccg caatacttac cttgaactct atgccaatga aagcggcaat 1980 
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cattttgctg aacctcagcg tattgatctg ccggatgggg tacgttttga tgatacttgt 2040 

cggttacaaa tagcggatac acaaggatta gggactgcca gcattatttt gacgatcccc 2100 

catatgaagg tgcagcactg gcgattggat atgaccatat tcaagccttg gctgctgaat 2160 

gccgtcaata acaatatggg aacagaaaco acgctgtatt atcgcagctc tgcccagttc 2220 

tggctggatg agaaattaca ggcttctgaa tccgggatga cggtggtcag ctacttaccg 2280 

ttcccggtgc atgtgttgtg gcgcacggaa gtgctggatg aaatttccgg taaccgattg 2340 

accagccatt atcattactc acatggtgcc tgggatggtc tggaacggga gtttcgtggt 24 00 

tttgggcggg tgacacaaac tgatattgat tcacgggcga gtgcgacaca ggggacacat 2460 

gctgaaccac cggcaccttc gcgcacggtt aattggtacg gcactggcgt acgggaagto 2520 

gatattcttc tgcccacgga atattggcag ggggatcaac aggcatttcc ccattttacc 2580 

ccacgcttta cccgttatga cgaaaaatcc ggtggtgata tgacggtcac gccgagcgaa 2640 

caggaagaat actggttaca tcgagcctta aaaggacaac gtttacgcag tgagctgtat 2700 

ggggatgatg attctatact ggccggtacg ccttattcag tggatgaatc ccgcacccaa 2760 

gtacgtttgt taccggtgat ggtatcggac gtgcctgcgg tactggtttc ggtggccgaa 2820 

tcccgccaat accgatatga acgggttgct accgatccac agtgcagcca aaagatcgtc 2880 

cttaaatctg atgcgttagg atttccgcag gacaatcttg agattgccta ttcgagacgt 2940 

ccacagcctg agttctcgcc ttatccggat accctgcccg aaacactttt caccagcagt 3000 

ttcgacgaac agcagatgtt ccttcgtctg acacgccagc gttcttctta tcatcatctg 3060 

aatcatgatg ataatacgtg gatcacaggg cttatggata cctcacgoag tgacgcacgt 3120 

atttatcaag ccgataaagt gccggacggt ggattttccc ttgaatggtt ttctgccaca 3180 

ggtgcaggag cattgttgtt gcctgatgoc gcagccgatt atctgggaca tcagcgtgta 3240 

gcatataccg gtccagaaga acaacccgct attoctccgc tggtggcata cattgaaacc 3300 

gcagagtttg atgaacgatc gttggcggct tttgaggagg tgatggatga gcaggagctg 3360 

acaaaacagc tgaatgatgc gggctggaat acggcaaaag tgccgttcag tgaaaagaca 3420 

gatttccatg tctgggtggg acaaaaggaa tttacagaat atgccggtgc agacggattc 3480 
tatcggccat tggtgcaacg ggaaaccaag cttacaggta aaacgacagt cacgtgggat 
agccattact gtgttatcac cgcaacagag gatgcggctg gcctgcgtat gcaagcgcat 

tacgattatc gatttatggt tgcggataac accacagatg tcaatgataa ctatcacacc 3660 

gtgacgtttg atgcactggg gagggtaacc agcttccgtt tctgggggac tgaaaacggt 3720 
gaaaaacaag gatatacccc tgcggaaaat gaaactgtcc cctttattgt coccacaacg 



3540 
3600 



3780 
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gtggatgatg ctctggcatt gaaacccggt atacctgttg cagggctgat ggtttatgcc 3840 

cctctgagct ggatggttca ggccagcttt tctaatgatg gggagcttta tggagagctg 3900 

aaaccggctg ggatcatcac tgaagatggt tatctcctgt cgcttgcttt tcgccgctgg 3960 

caacaaaata accctgccgc tgccatgcca aagcaagtca attcacagaa cccaccccat 4020 

gtactgagtg tgatcaccga ccgctatgat gccgatccgg aacaacaatt acgtcaaacg 4080 

tttacgttta gtgatggttt tgggcgaacc ttacaaacag ccgtacgcca tgaaagtggt 4140 

gaagcctggg tacgtgatga gtatggagcc attgtggctg aaaatcatgg cgcgcctgaa 4200 

acggcgatga cagatttccg ttgggcagtt tccggacgta cagaatatga cggaaaaggc 4260 

caagccctgc gtaagtatca accgtatttc ctgaatagtt ggcagtacgt cagtgatgac 4320 

agtgcccggc aggatatata tgccgatacc cattactatg atccgttggg gcgtgaatat 4380 

caggttatca cggccaaagg cgggtttcgt cgatccttat tcactccctg gtttgtggtg 4440 

aatgaagatg aaaatgacac tgccggtgaa atgacagcat aa 44 b 2 

<210> 21 
<211> 3051 
<212> DNA 

<213> Xenorhabdus strain Xwi 
<400> 21 

atgaagaatt tcgttcacag caatacgcca tccgtcaccg tactggacaa ccgtggtcag 60 

acagtacgcg aaatagcctg gtatcggcac cccgatacac ctcaggtaac cgatgaacgc 12 0 

atcaccggtt atcaatatga tgctcaagga tctctgactc agagtattga tccgcgattt 180 

tatgaacgcc agcagacagc gagtgacaag aacgccatta cacccaatct tattctcttg 240 

tcatcactca gtaagaaggc attgcgtacg caaagtgtgg atgccggaac ccgtgtcgcc 300 

ctgcatgatg ttgccgggcg tcccgtttta gctgtcagcg ccaatggcgt tagccgaacg 360 

tttcagtatg aaagtgataa ccttccggga cgattgctaa cgattaccga gcaggtaaaa 420 

ggagagaacg cctgtatcac ggagcgattg atctggtcag gaaatacgcc ggcagaaaaa 480 

ggcaataatc tggccggcca gtgcgtggtc cattatgatc ccaccggaat gaatcaaacc 540 

aacagcatat cgttaaccag catacccttg tccatcacac agcaattact gaaagatgac 600 

agcgaagccg attggcacgg tatggatgaa tctggctgga aaaacgcgct ggcgccggaa 660 

agcttcactt ctgtcagcac aacggatgct accggcacgg tattaacgag tacagatgct 720 

gccggaaaca agcaacgtat cgcctatgat gtggccggtc tgcttcaagg cagttggttg 780 

gcgctgaagg ggaaacaaga acaagttatc gtgaaatccc tgacctattc ggctgccagc 84 0 

cagaagctac gggaggaaca tggtaacggg atagtgacta catataccta tgaacccgag 900 
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acgcaacgag ttattggcat aaaaacagaa cgtccttccg gtcatgccgc tggggagaaa 960 

attttacaaa acctgcgtta tgaatatgat cctgtcggaa atgtgctgaa atcaactaat 1020 

gatgctgaaa ttacccgctt ttggcgcaac cagaaaattg taccggaaaa tacttacacc 1080 

tatgacagcc tgtaccagct ggtttccgtc actgggcgtg aaatggcgaa tattggccga 114 0 

caaaaaaacc agttacccat ccccgctctg attgataaca atacttatac gaattactct 1200 

cgcacttacg actatgatcg tgggggaaat ctgaccagaa ttcgccataa ttcaccgatc 1260 

accggtaata actatacaac gaacatgacc gtttcagatc acagcaaccg ggctgtactg 132 0 

gaagagctgg cgcaagatcc cactcaggtg gatatgttgt tcacccccgg cgggcatcag 1380 

acccggcttg ttcccggtca ggatcttttc tggacacccc gtgacgaatt gcaacaagtg 144 0 

atattggtca atagggaaaa tacgacgcct gatcaggaat tctaccgtta tgatgcagac 1500 

agtcagcgtg tcattaagac tcatattcag aagacaggta acagtgagca aatacagcga 1560 

acattatatt tgccagagct ggaatggcgc acgacatata gcggcaatac attaaaagag 1620 

tttttgcagg tcatcactgt cggtgaatcg ggtcaggcac aagtgcgggt gctgcattgg 1680 

gaaacaggca aaccggcgga tatcagcaat gatcagctgc gctacagtta tggcaacctg 1740 

attggcagta gcgggctgga attggacagt gacgggcaga tcattagtca ggaagaatat 1800 

tacccctatg ggggaaccgc cgtgtgggca gcccgaagtc agtcagaagc tgattacaaa 1860 

accgtgcgtt attctggcaa agagcgggat gcaacagggt tgtattacta cggttatcgt 1920 

tattatcaat cgtggacagg gcgatggttg agtgtagatc ctgccggtga ggtcgatggt 1980 

ctcaatttgt tccgaatgtg caggaataac cccatcgttt tttctgattc tgatggtcgt 2040 

ttccccggtc agggtgtcct tgcctggata gggaaaaaag cgtatcgaaa ggcagtcaac 2100 

atcacgacag aacacctgct tgaacaaggc gcttcctttg atacgttctt gaaattaaac 2160 

cgaggattgc gaacgtttgt tttgggtgtg ggggtagcaa gtctgggggt gaaggcggcc 2220 

acgattgcag gagcgtcgcc ttgggggatt gtcggggctg ccattggtgg ttttgtetec 2280 

ggggcggtga tggggttttt cgcgaacaac atctcagaaa aaattgggga agttttaagt 2340 
tatctgacgc gtaaacgttc tgttcctgtt caggttggcg cttttgttgt cacatcgctt 2400 
gtgacgtctg cactatttaa cagctcttcg acaggtaccg ccatttccgc agcaacagcg 2460 
gtcaccgttg gaggattaat ggctttagcc ggagagcata acacgggcat ggctatcagt 2520 
attgccacac ccgccggaca aggtacgctg gatacgctca ggcccggtaa tgtcagcgcg 2580 
ccagagcggt taggggcact atcaggcgca attattggcg gcatattact tggccgccat 2640 
cagggaagtt ctgagctggg tgaacgggca gcgattggtg ctatgtatgg tgctcgatgg 2700 
ggaaggatca ttggtaatct atgggatggc ccttatcggt ttatcggcag gttactgctc 2760 
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agaagaggca ttagctctgc catttcccac gctgtcagtt ccaggagctg gtttggccga 2820 

atgataggag aaagtgtcgg gagaaatatt tctgaagtat tattacctta tagccgtaca 2880 

cccggtgaat gggttggtgc agccattggc gggacagccg cggccgctca tcatgccgtt 2940 

ggaggggaag ttgccaatgc cgctagccgg gttacctgga gcggctttaa gcgggctttt 3000 



<210> 22 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer SB101 

<400> 22 

gckatggcsg acccgatgca wtacaagctg gc 32 

<210> 23 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer SB102 



<210> 24 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer SB103 

<400> 24 

tgtataactg gatggcyggw cgtctstc 28 

<210> 25 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer SB104 



aataacttct 



tctttaacgc ctctgcacgt cataatgaat ccgaagcata a 



3051 



<400> 23 

agcggytgac crtccagrct carattgtgg eg 



32 



<400> 25 

tcraaaggca graamegget gtcgtt 



26 



<210> 26 
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<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer SB105 
<400> 26 

cttcyctkga tatcytkytg gatgtgct 28 



<210> 27 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer SB106 

<400> 27 

acgrctggya ttggyaatca gccartccaa 30 



<210> 28 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer SB212 



<220> 

<221> misc_feature 

<222> (7).. (7) 

<223> n = i (inosine) 

<400> 28 

cgytatnaat atgaycckgt vggyaat 27 



<210> 29 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer SB213 



<220> 

<221> mis cofeature 

<222> (17).. (17) 

<223> n a i (inosine) 

<400> 29 

catcbcgytc tttrccngar tarcg 25 



<210> 30 
<211> 33 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer SB215 



<220> 

<221> misc_feature 

<222> (10).. (10) 

<223> n = i (inosine) 

<400> 30 

cghagctcyn cccagtwytg gctggatgar aaa 33 

<210> 31 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer SB217 



<220> 

<221> misc_feature 

<222> (24).. (24) 

<223> n = i (inosine) 

<400> 31 

gtrtcatttt catcttcrtt bacnryaaac ca 32 

<210> 32 
<211> 1293 
<212> DNA 

<213> Paenibacillus apairus strain DB482 
<400> 32 

gcagcccgaa ggctccggca cgctggcatc cttgaaggat acctaccatc cgatgaccct 60 

tccctatgat gacgaccttg cgcaaatcaa tgccgtggcg gaggcgcact catctaattt 12 0 

gctggggatt tgggataccc tgctggacac gcagcggact tccatcctgc agaattccgc 180 

cgctgcctgc cggataagca aggcgcggca atcggcatcc ccggatcaga gagcctccga 24 0 

tgatgagccg gtattgatta caggagaaga attctacctg gagacgggcg gcaaacggct 300 

ttttctggcg cataaactcg agataggctc cacgataagc gccaaaatca acattggacc 360 

gccgcaagcg gccgatatcg cgccagcaaa gttgcaactc gtttattacg gcagaggcgg 420 

cagaggggac tacttccttc gtgtggcaga cgatgtgtcc ctcggtggaa aattgctgaa 480 

caattgttat ctgaccagcg acgacggaca gagcaacaat attaacggac cattctgcct 540 

aatgattaat cgaggcaccg gcagcatgcc cagcgggact cacctgccag ttcagattga 600 

cagagtgaca gatacatccc tacgcatttt tgtgccgcaa cacggttact tgggactagg 660 
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agaaagcctt 
gttgaccttt 
gatgccgcct 
gccaaacagc 
ctacaacgtt 
tgtcgatgat 
gcagaaggat 
cggggagcat 
tccgttgtgg 
ctttacggcg 
gcgaagcacg 



gccagcaact 
accctaagaa 
gttgccgaca 
ttccgtctgc 
aagactgcca 
ttctgcatga 
tatcagtcaa 
gttccggttt 
gcaaaacagt 
gataatgttg 
ggtctttcct 



ggaatgaacc 
agaatgagtc 
cgaccccgtc 
tggttaaccc 
taacccgagc 
agaccggctt 
aaagcagtga 
caacctatgg 
ataacagcgc 
cagctttggc 
ttgagcagtt 



80 

gttggcgctg 
cggacaagat 
cccgccgacg 
cgagccgaca 
tcctgccgat 
gagctttgat 
gtacaaaagc 
agctgtgttt 
aggcgctgca 
aggaagagcg 

gga 



aatctggact 
accatttcca 
agggaaacgc 
gaagaagaca 
ctggccgccg 
gaattgctga 
cgatttgtaa 
ttgacaggta 
acagacaccc 
gaaaagcttg 



tggatcaagc 
taatcgatat 
tttccttgac 
tcgccaagca 
ccttaaatgt 
acttaacgat 
aatttggcgg 
cggaagaaac 
ctgttttgaa 
tgcggctggc 



72 0 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1293 



<210> 33 

<211> 430 

<212> PRT 

<213> Paenibacillus apairius strain DB482 

<400> 33 

Gin Pro Glu Gly Ser Gly Thr Leu Ala Ser Leu Lys Asp Thr Tyr His 
1 5 10 15 



Pro Met Thr Leu Pro Tyr Asp Asp Asp Leu Ala Gin lie Asn Ala Val 
20 ~ 25 30 



Ala Glu Ala His Ser Ser Asn Leu Leu Gly lie Trp Asp Thr Leu Leu 
35 40 45 



Asp Thr Gin Arg Thr Ser lie Leu Gin Asn Ser Ala Ala Ala Cys Arg 
50 55 60 



He Ser Lys Ala Arg Gin Ser Ala Ser Pro Asp Gin Arg Ala Ser Asp 
65 70 75 80 



Asp Glu Pro Val Leu He Thr Gly Glu Glu Phe Tyr Leu Glu Thr Gly 
85 90 95 



Gly Lys Arg Leu Phe Leu Ala His Lys Leu Glu He Gly Ser Thr He 
100 105 HO 



Ser Ala Lys He 
115 



Asn He Gly Pro Pro 
120 



Gin Ala Ala Asp He Ala Pro 
125 



WO 2004/002223 



PCT/US2003/020082 



81 



Ala Lys Leu Gin Leu Val Tyr Tyr Gly Arg Gly Gly Arg Gly Asp Tyr 
130 135 " 140 



Phe Leu Arg Val Ala Asp Asp Val Ser Leu Gly Gly Lys Leu Leu Asn 
145 150 155 160 



Asn Cys Tyr Leu Thr Ser Asp Asp Gly Gin Ser Asn Asn lie Asn Gly 
165 170 175 



Pro Phe Cys Leu Met He Asn Arg Gly Thr Gly Ser Met Pro Ser Gly 
180 185 190 



Thr His Leu Pro Val Gin He Asp Arg Val Thr Asp Thr Ser Leu Arg 
195 200 205 



He Phe Val Pro Gin His Gly Tyr Leu Gly Leu Gly Glu Ser Leu Ala 
210 215 220 



Ser Asn Trp Asn Glu Pro Leu Ala Leu Asn Leu Asp Leu Asp Gin Ala 
225 230 235 240 



Leu Thr Phe Thr Leu Arg Lys Asn Glu Ser Gly Gin Asp Thr He Ser 
245 250 255 



He He Asp Met Met Pro Pro Val Ala Asp Thr Thr Pro Ser Pro Pro 
260 265 270 



Thr Arg Glu Thr Leu Ser Leu Thr Pro Asn Ser Phe Arg Leu Leu Val 
275 280 285 



Asn Pro Glu Pro Thr Glu Glu Asp He Ala Lys His Tyr Asn Val Lys 
290 295 300 



Thr Ala He Thr Arg Ala Pro Ala Asp Leu Ala Ala Ala Leu Asn Val 
305 310 315 320 



Val Asp Asp Phe Cys Met Lys Thr Gly Leu Ser Phe Asp Glu Leu Leu 
325 330 335 



Asn Leu Thr Met Gin Lys Asp Tyr Gin Ser Lys Ser Ser Glu Tyr Lys 
340 345 350 



Ser Arg Phe Val Lys Phe Gly Gly Gly Glu His Val Pro Val Ser Thr 
355 360 365 



V 
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Tyr Gly Ala Val Phe Leu Thr Gly Thr Glu Glu 
370 375 



Thr Pro Leu Trp Ala 
380 



Lys Gin Tyr Asn Ser Ala Gly Ala Ala Thr Asp 
385 * 390 395 



Thr Pro Val Leu Asn 
400 



Phe Thr Ala Asp Asn Val Ala Ala Leu Ala Gly 
405 410 



Arg Ala Glu Lys Leu 
415 



Val Arg Leu Ala Arg Ser Thr Gly Leu Ser Phe 
420 425 



Glu Gin Leu 
430 



<210> 34 
<211> 340 
<212> DNA 

<213> Paenibacillus apairius strain DB482 
<400> 34 

tatatttatt cacaccttgg acttcctgat caaccgcggc gacagcttgt accggctgct 60 
ggagcgggat actctgaccg aagccaagat gtattacatc caggccagcc aactgcttgg 120 
tccccgcccc gatatccgga tcaatcacag ttggcctaat ccgaccctgc aaagcgaagc 180 
ggacgcggtg accgccgtac cgacgcgaag cgattcgcgg gcaacgccaa tcctcgcctt 240 
gcgagcgctt ctgaaagcgg aaaacgggca tttcctgccg ccttataatg atgaactgtt 300 
agctttctgg gataaaatcg atctgcgttt atacaattta 340 

<210> 35 
<211> 565 
<212> DNA 

<213> Paenibacillus apairius strain DB482 
<400> 35 

gtctctatac tatcaaatgt atgacgccgc attgccgctc tgcttgatgg ccaaacaggc 60 

tttagagaaa gaaatcggca ctgataaaac gggtggagtt ttcaccctcc cggcctggaa 120 

tgatctgtat cagggattac tggcggggga ggcgctgctg ctcgagcttc agaagctgga 180 

gaatctgtgg ctggaggaag acaagcgcgg aatggaagcc gtaaaaacgg tatctttaga 240 

tacccttctc cgcaaagaaa cgccagagtc tagcttcgta gagctagtca aggaagttct 300 

ggacggaaag acgcctgacc ctgtaggcgg agtcggcgta cagctgcaaa acaatatttt 360 

cagcgcaacc cttgacctgt ccgttcttgg cttggatcgc tcttacaacc aagcggaaaa 420 

gacccgcagg atcaaaaatc tgtcggttac cttacccgcg cttttgggac cttaccagga 480 

tatagaagca accttatcgc taggcggcga gaccgttgcg ctttcccatg gcgtggatga 540 

cagcggcttg ttcatcacgg atctt 565 



WO 2004/002223 



PCT/US2003/020082 



83 



<210> 36 

<211> 113 

<212> PRT 

<213> Paenibacillus apairius strain DB482 

<400> 36 

He Phe He His Thr Leu Asp Phe Leu He Asn Arg Gly Asp Ser Leu 
1 5 10 15 

Tyr Arg Leu Leu Glu Arg Asp Thr Leu Thr Glu Ala Lys Met Tyr Tyr 
20 25 30 

He Gin Ala Ser Gin Leu Leu Gly Pro Arg Pro Asp He Arg He Asn 
35 40 45 

His Ser Trp Pro Asn Pro Thr Leu Gin Ser Glu Ala Asp Ala Val Thr 
50 55 S0 

Ala Val Pro Thr Arg Ser Asp Ser Arg Ala Thr Pro He Leu Ala Leu 
65 70 75 so 

Arg Ala Leu Leu Lys Ala Glu Asn Gly His Phe Leu Pro Pro Tyr Asn 
85 90 95 

Asp Glu Leu Leu Ala Phe Trp Asp Lys He Asp Leu Arg Leu Tyr Asn 
100 105 no 

Leu 



<210> 37 

<211> 188 

<212> PRT 

<213> Paenibacillus apairius strain DB482 

<400> 37 

Ser Leu Tyr Tyr Gin Met Tyr Asp Ala Ala Leu Pro Leu Cys Leu Met 
1 5 10 is' 

Ala Lys Gin Ala Leu Glu Lys Glu He Gly Thr Asp Lys Thr Gly Gly 
20 25 30 

Val Phe Thr Leu Pro Ala Trp Asn Asp Leu Tyr Gin Gly Leu Leu Ala 
35 40 45 



Gly Glu Ala Leu Leu Leu Glu Leu Gin Lys Leu Glu Asn Leu Trp Leu 
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50 55 60 

Glu Glu Asp Lys Arg Gly Met Glu Ala Val Lys Thr Val Ser Leu Asp 
65 70 75 80 

Thr Leu Leu Arg Lys Glu Thr Pro Glu Ser Ser Phe Val Glu Leu Val 
85 90 95 

Lys Glu Val Leu Asp Gly Lys Thr Pro Asp Pro Val Gly Gly Val Gly 
100 105 110 

Val Gin Leu Gin Asn Asn He Phe Ser Ala Thr Leu Asp Leu Ser Val 
115 120 125 

Leu Gly Leu Asp Arg Ser Tyr Asn Gin Ala Glu Lys Thr Arg Arg He 
130 135 140 

Lys Asn Leu Ser Val Thr Leu Pro Ala Leu Leu Gly Pro Tyr Gin Asp 
145 150 155 160 

He Glu Ala Thr Leu Ser Leu Gly Gly Glu Thr Val Ala Leu Ser His 
165 170 175 

Gly Val Asp Asp Ser Gly Leu Phe He Thr Asp Leu 



<210> 38 
<211> 2091 
<212> DNA 

<213> Paenibacillus apairius strain DB482 
<400> 38 

caggcaacct catcggctgt ctgtggtgtg ccattcccga tcaacgtggt atcggacata 60 

cacacggtgg acgaaatcag cggcagcgcc aggattcaga agtatactta ccgcaatggc 120 

gtgtatgacc ggaccgataa ggaatttgcc gggttcggcc acattgacac atgggaagag 180 

gagcgggatt ccgagggaac ccttagcatc agcactcccc ccgtgctgac acggacctgg 240 

tatcataccg ggcaaaagca ggatgaggag cgtgccgtgc agcaatattg gcaaggcgac 300 

cctgccgctt ttcaggttaa acccgtccgg cttactcgat tcgatgcggc aacggcccag 360 

gatgtcccgc tagactctcc caataggcgg gaagagtatt ggctgtatcg ctcgttgcga 420 

gggatgccgc tgcgtaatga aatttttgct ggagatgttg tggggttgcc tccttatcag 480 

gtggagagct tacgttatca agtgcgcttg atgcagagca ccgattcgga atgtgttaca 540 

ttgcccatgc agttggagca gcttacgtac aactatgagc aaatcgcctc tgatccgcag 600 
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tgttcacagc agatacagca atggttcgac gaatacggcg tggcggcaca gagtataacg 660- 

atccaafcatc cgcgccgggc acagccggag gacaatccgt accctcacac gctgccggat 720 

accagctgga gcagcagtta tgattcgcag caaatgctgc tgcggttaac aaggcaaagg 780 

caaaaagcgt accaccttgc agaccctgaa ggctggcgct tgaatatccc ccatcagaca 840 

cgcctggatt ctt teat eta ttctgetgae agcgtgcctg ccgaaggaat aagegcagag 900 

ctgctggggg gtgaeggcac gttacgatct ccggcgctgg aacaggctta tggcggccag 960 

tcagagatca tetatgeggg egggggggaa ccggattcgc gagctctggt ccattacacc 1020 

agaagegega ttctcgatga agcctgtttg caagectatg aaggegtact gagegatage 1080 

caattgaact cgcttcttgc atcttccggc tatcaacgaa gcgcaagaat attgggttcc 114 0 

ggcgatgaag eggatatttt tgttgcggaa caaggattta cccgttatgc ggatgaacag 1200 

aattttttcc gtattctggg acaacaatcc tctctcttga ceggggaaca agtattaaca 1260 

tgggatgata atttctgtgc ggtaacatcc atagaagacg cgcttggcaa tcaaattcag 1320 

attgeatatg attaccgett tgtggaggct atccagatta ccgatgcgaa taacaatgtg 1380 

aatcaggtct ccctggatgc tctcggccgg gtegtataca gccggacctg gggcacagag 1440 

gaagggatag agaegggett ccgcccggag geggaattet cgccgcccga gacaatggag 1500 

caggegcttg ccctggcgtc tcccttgccg gttgeatect gctgtgttta tgatgegcat 1560 

agctggatgg gaacgataac tcttgggcag ctgtcagcgc ttgttccaga tagtgaaaag 1620 

caatggtcgt tcttgatagc caategcttg attatgeegg aeggcaggat aagagcccgc 1680 

ggccgggccc catggtggct teaaeggcta ttgccgcctg ccgtggccaa getgetgage 1740 

gaggeggace gtaagccgcc geataeggta gttttggcag cagategcta cccggatgac 1800 

ccatcccagc aaattcaggc cagegtegtg tttagcgatg gctttgggcg tacgatacaa i860 

acegctaaaa gagcagatac ccgatgggcg attaeggaac ggattgacta tgacgaaacc 1920 

ggagccgtaa tccgaagctt teagectttt tatattgatg actggaatta tgtgggcaaa 1980 

gaggctgtca geggctctat gtatgcaacg atctattact atgatgetet ggcacgccaa 2040 

ctaaggatgg tcaacgccaa aggatatgag aggagaactg ctttttaccc a 2091 

<210> 39 
<211> 697 
<212> PRT 

<213> Paenibacillus apairius strain DB482 
<400> 39 

Gin Ala Thr Ser Ser Ala Val Cys Gly Val Pro Phe Pro He Asn Val 
1 5 10 
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Val Ser Asp lie His Thr Val Asp Glu lie Ser Gly Ser Ala Arg He 
20 25 30 



Gin Lys Tyr Thr Tyr Arg Asn Gly Val Tyr Asp Arg Thr Asp Lys Glu 
35 40 45 



Phe Ala Gly Phe Gly His He Asp Thr Trp Glu Glu Glu Arg Asp Ser 
50 55 60 



Glu Gly Thr Leu Ser He Ser Thr Pro Pro Val Leu Thr Arg Thr Trp 
65 70 75 80 



Tyr His Thr Gly Gin Lys Gin Asp Glu Glu Arg Ala Val Gin Gin Tyr 
85 90 95 



Trp Gin Gly Asp Pro Ala Ala Phe Gin Val Lys Pro Val Arg Leu Thr 
100 105 HO 



Arg Phe Asp Ala Ala Thr Ala Gin Asp Val Pro Leu Asp Ser Pro Asn 
115 12 0 125 



Arg Arg Glu Glu Tyr Trp Leu Tyr Arg Ser Leu Arg Gly Met Pro Leu 
130 135 140 



Arg Asn Glu He Phe Ala Gly Asp Val Val Gly Leu Pro Pro Tyr Gin 
145 150 155 160 



Val Glu Ser Leu Arg Tyr Gin Val Arg Leu Met Gin Ser Thr Asp Ser 
165 170 175 



Glu Cys Val Thr Leu Pro Met Gin Leu Glu Gin Leu Thr Tyr Asn Tyr 
180 185 190 



Glu Gin He Ala Ser Asp Pro Gin Cys Ser Gin Gin He Gin Gin Trp 
195 200 205 



Phe Asp Glu Tyr Gly Val Ala Ala Gin Ser He Thr He Gin Tyr Pro 
210 ' 215 220 



Arg Arg Ala Gin Pro Glu Asp Asn Pro Tyr Pro His Thr Leu Pro Asp 
225 230 235 240 



Thr Ser Trp Ser Ser Ser Tyr Asp Ser Gin Gin Met Leu Leu Arg Leu 
245 250 255 
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Thr Arg Gin Arg Gin Lys Ala Tyr His Leu Ala Asp Pro Glu Gly Trp 
260 265 270 



Arg Leu Asn He Pro His Gin Thr Arg Leu Asp Ser Phe He Tyr Ser 
275 280 285 



Ala Asp Ser Val Pro Ala Glu Gly He Ser Ala Glu Leu Leu Gly Gly 
290 295 300 



Asp Gly Thr Leu Arg Ser Pro Ala Leu Glu Gin Ala Tyr Gly Gly Gin 
305 310 315 320 

Ser Glu He He Tyr Ala Gly Gly Gly Glu Pro Asp Ser Arg Ala Leu 
325 330 335 

Val His Tyr Thr Arg Ser Ala He Leu Asp Glu Ala Cys Leu Gin Ala 
340 345 350 



Tyr Glu Gly Val Leu Ser Asp Ser Gin Leu Asn Ser Leu Leu Ala Ser 
355 360 365 



Ser Gly Tyr Gin Arg Ser Ala Arg He Leu Gly Ser Gly Asp Glu Ala 
370 375 380 



Asp He Phe Val Ala Glu Gin Gly Phe Thr Arg Tyr Ala Asp Glu Gin 
385 390 395 " 400 

Asn Phe Phe Arg He Leu Gly Gin Gin Ser Ser Leu Leu Thr Gly Glu 
405 410 415 

Gin Val Leu Thr Trp Asp Asp Asn Phe Cys Ala Val Thr Ser He Glu 
420 425 430 



Asp Ala Leu Gly Asn Gin He Gin He Ala Tyr Asp Tyr Arg Phe Val 
435 440 445 



Glu Ala He Gin He Thr Asp Ala Asn Asn Asn Val Asn Gin Val Ser 
450 455 460 



Leu Asp Ala Leu Gly Arg Val Val Tyr Ser Arg Thr Trp Gly Thr Glu 
465 470 475 " 480 



Glu Gly He Glu Thr Gly Phe Arg Pro Glu Ala Glu Phe Ser Pro Pro 
485 490 495 



Glu Thr Met Glu Gin Ala Leu Ala Leu Ala Ser Pro Leu Pro Val Ala 
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500 505 510 



Ser Cys Cys Val Tyr Asp Ala His Ser Trp Met Gly Thr He Thr Leu 
515 520 525 



Gly Gin Leu Ser Ala Leu Val Pro Asp Ser Glu Lys Gin Trp Ser Phe 
530 535 540 



Leu He Ala Asn Arg Leu He Met Pro Asp Gly Arg He Arg Ala Arg 
545 550 555 560 



Gly Arg Ala Pro Trp Trp Leu Gin Arg Leu Leu Pro Pro Ala Val Ala 
565 570 575 



Lys Leu Leu Ser Glu Ala Asp Arg Lys Pro Pro His Thr Val Val Leu 
580 585 590 



Ala Ala Asp Arg Tyr Pro Asp Asp Pro Ser Gin Gin He Gin Ala Ser 
595 600 605 



Val Val Phe Ser Asp Gly Phe Gly Arg Thr He Gin Thr Ala Lys Arg 
610 615 620 



Ala Asp Thr Arg Trp Ala He Thr Glu Arg He Asp Tyr Asp Glu Thr 
625 630 635 640 



Gly Ala Val He Arg Ser Phe Gin Pro Phe Tyr He Asp Asp Trp Asn 
645 650 655 



Tyr Val Gly Lys Glu Ala Val Ser Gly Ser Met Tyr Ala Thr He Tyr 
660 665 670 



Tyr Tyr Asp Ala Leu Ala Arg Gin Leu Arg Met Val Asn Ala Lys Gly 
675 680 685 



Tyr Glu Arg Arg Thr Ala Phe Tyr Pro 
690 "~ 695 



<210> 40 
<211> 858 
<212> DNA 

<213> Paenihacillus apairius strain DB482 
<400> 40 

atcctgtcta tctgcaatga agcggagccg gtccgttatt tccgcaatca ggccgtcgct 60 



ccgaaaaggc agtatgctta cgatgccctg tatcagcttg tatccagctc ggggcgggaa 12 0 
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89 

ttcgtcagca gacgtcgctt cctcccttga tcacgcctat tcctctcgac 


180 


gatagccaat 


acgtcaatta tgctgagaga tacagctatg atcgggcggg caatctaatc 


240 


aagcttagcc 


atcatggggc aagtcaatat acaacgaatg tgcatgtgga caaaagttca 


300 


aaccggggga 


tttggcggca aggggaagac atcccggata tcgcggcttc ctttgacaga 


360 


gcaggcaatc 


aacaagattt attcccgggg agacggttgg aatgggatac acgcaatcag 


420 


ttatgccgtg 


tccatatggt cgtgcgcgaa ggcggcgata acgactggga gggctatctc 


480 


tatgacagct 


caggaatgcg catcgtaaaa cattctaccc gcaagacaca gacgacaacg 


540 


caaacggata 


cgacgatcta tttgccgggc ctggagcttc gcatccgcca aaccggggac 


600 


agggtcacgg 


aagcattgca ggtcattacc gtggatgagg gagcgggaca agtgagggtg 


660 


ctgcactggg 


aggatggaac cgagccgggc ggcatagcca atgatcagta tcggtacagc 


720 


ctaaacgatc 


atcttggctc ctctttattg gaagttgacg ggcaaagtca gatcattagc 


780 


aaggaagaat 


tttatcccta tggcggcaca gcattgtgga cagcccggtc agaggtggag 


840 


gcaagctaca 


agaccacg 


858 



<210> 41 
<211> 286 
<212> PRT 

<213> Paenibacillus apairius strain DB4 82 
<400> 41 

lie Leu Ser lie Cys Asn Glu Ala Glu Pro Val Arg Tyr Phe Arg Asn 
15 10 15 

Gin Ala Val Ala Pro Lys Arg Gin Tyr Ala Tyr Asp Ala Leu Tyr Gin 
20 25 30 

Leu Val Ser Ser Ser Gly Arg Glu Ser Asp Ala Leu Arg Gin Gin Thr 
35 40 45 

Ser Leu Pro Pro Leu lie Thr Pro lie Pro Leu Asp Asp Ser Gin Tyr 
50 55 60 

Val Asn Tyr Ala Glu Arg Tyr Ser Tyr Asp Arg Ala Gly Asn Leu He 
65 70 75 80 

Lys Leu Ser His His Gly Ala Ser Gin Tyr Thr Thr Asn Val His Val 
85 90 95 



Asp Lys Ser Ser Asn Arg Gly He Trp Arg Gin Gly Glu Asp He Pro 
100 105 110 
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Asp He Ala Ala Ser Phe Asp Arg Ala Gly Asn Gin Gin Asp Leu Phe 
115 120 125 



Pro Gly Arg Arg Leu Glu Trp Asp Thr Arg Asn Gin Leu Cys Arg Val 
130 135 140 



His Met Val Val Arg Glu Gly Gly Asp Asn Asp Trp Glu Gly Tyr Leu 
145 150 155 160 



Tyr Asp Ser Ser Gly Met Arg He Val Lys His Ser Thr Arg Lys Thr 
165 170 175 



Gin Thr Thr Thr Gin Thr Asp Thr Thr He Tyr Leu Pro Gly Leu Glu 
180 185 190 



Leu Arg He Arg Gin Thr Gly Asp Arg Val Thr Glu Ala Leu Gin Val 
195 200 205 



He Thr Val Asp Glu Gly Ala Gly Gin Val Arg Val Leu His Trp Glu 
210 215 220 



Asp Gly Thr Glu Pro Gly Gly He Ala Asn Asp Gin Tyr Arg Tyr Ser 
225 230 235 240 



Leu Asn Asp His Leu Gly Ser Ser Leu Leu Glu Val Asp Gly Gin Ser 
245 250 255 



Gin He He Ser Lys Glu Glu Phe Tyr Pro Tyr Gly Gly Thr Ala Leu 
260 ~ 265 270 



Trp Thr Ala Arg Ser Glu Val Glu Ala Ser Tyr Lys Thr Thr 
275 280 285 



<210> 42 
<211> 4434 
<212> DNA 

<213> Photorhabdus strain W14 
<400> 42 

atgatgcaga attcacaaac attcagtgtt accgagctgt cattacccaa aggcggcggc 60 
gctattaccg gtatgggtga agcattaaca ccagccgggc cggatggtat ggccgcctta 120 
tccctgccat tacccatttc cgccgggcgt ggttacgcac cctcgctcac tctgaattac 180 
aacagtggaa ccggtaacag cccatttggt ctcggttggg actgcggcgt catggcaatt 240 
cgtcgtcgca ccagtaccgg cgtaccgaat tacgatgaaa ccgatacttt tctggggccg 300 
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gaaggtgaag tgttggtcgt agcattaaat gaggcaggtc aagctgatat ccgcagtgaa 360 

tcctcattgc agggcatcaa tttgggtgcg accttcaccg ttacctgtta tcgctcccgc 420 

ctagaaagcc actttaaccg gttggaatac tggcaacccc aaacaaccgg cgcaaccgat 480 

ttctggctga tatacagccc cgacggacag gtccatttac tgggcaaaaa tcctcaggca 54 0 

cgtatcagca atccactcaa tgttaaccaa acagcgcaat ggctgttgga agcctcgata 600 

tcatcccaca gcgaacagat ttattatcaa tatcgcgctg aagatgaagc aggttgtgaa 660 

accgacgagc tagcagccca ccccagcgca accgttcagc gctacctgca aacagtacat 720 

tacgggaacc tgaccgccag cgacgttttt cctacactaa acggagatga cccacttaaa 780 

tctggctgga tgttctgttt agtatttgac tacggtgagc gcaaaaacag cttatctgaa 840 

atgccgctgt ttaaagccac aggcaattgg ctttgccgaa aagaccgttt ttcccgttat 900 

gagtacggtt ttgaattgcg tactcgccgc ttatgccgcc aaatactgat gtttcaccgt 960 

ctacaaaccc tatctggtca ggcaaagggg gatgatgaac ctgcgctagt gtcgcgtctg 1020 

atactggatt atgacgaaaa cgcgatggtc agtacgctcg tttctgtccg ccgggtaggc 1080 

catgaggaca acaacacggt taccgcgctg ccaccactgg aactggccta tcagcctttt 1140 

gagccagaac aaaccgcact ctggcaatca atggatgtac tggcaaattt caacaccatt 1200 

cagcgctggc aactgcttga cctgaaagga gaaggcgtgc ccggcattct ctatcaggat 1260 

agaaatggct ggtggtatcg atctgcccaa cgtcaggccg gggaagagat gaatgcggtc 1320 

acctggggga aaatgcaact ccttcccatc acaccagctg tgcaggataa cgcctcactg 1380 

atggatatta acggtgacgg gcaactggac tgggtgatta ccgggccggg gctaaggggc 144 0 

tatcacagcc aacacccgga tggcagttgg acgcgtttta cgccattaca tgccctgccg 1500 

atagaatatt ctcatcctcg cgctcaactt gccgatttaa tgggagccgg gctgtccgat 1560 

ttagtgctaa ttggtcccaa aagtgtgcgc ttatatgtca ataaccgtga tggttttacc 1620 

gaagggcggg atgtggtgca atccggtgat atcaccctgc cgctaccggg cgccgatgcc 1680 

cgtaagttag tggcatttag tgacgtactg ggttcaggcc aagcacatct ggttgaagtt 174 0 

agtgcaactc aagtcacctg ctggccgaat ctggggcatg gccgttttgg tcagccaatc 1800 

gtattgccgg gattcagcca atctgccgcc agttttaatc ctgatcgagt tcatctggcc 1860 

gatttggatg ggagcggccc tgccgatttg atttatgttc atgctgaccg tctggatatt 1920 

ttcagcaatg aaagtggcaa cggttttgca aaaccattca cactctcttt tcctgacggc 1980 

ctgcgttttg atgatacctg ccagttgcaa gtagccgatg tacaagggtt aggcgttgtc 204 0 

agcctgatcc taagcgtacc gcatatggcg ccacatcatt ggcgctgcga tctgaccaac 2100 

gcgaaaccgt ggttactcag tgaaacgaac aacaatatgg gggccaatca caccttgcat 2160 
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taccgtagct 


ctgtccagtt 


ctggctggat 


gaaaaagctg 


cggcattggc 


taccggacaa 


2220 


acaccggtct 


gttacctgcc 


cttcccggtc 


catacccttt 


ggcaaacaga 


aaccgaggat 


2280 


gaaatcagcg 


gcaataagtt 


agtgaccacg 


ttacgttatg 


ctcacggcgc 


ttgggatgga 


2340 


cgtgaacggg 


aatttcgtgg 


ctttggttat 


gttgagcaga 


cagacagcca 


tcaactcgct 


2400 


caaggcaatg 


cgccggaacg 


tacaccaccg 


gcactcacca 


aaagctggta 


tgccaccgga 


2460 


ttacctgcgg 


tagataatgc 


gttatccgcc 


gggtattggc 


gtggcgataa 


gcaagctttc 


2520 


gccggtttta 


cgccacgttt 


tactctctgg 


aaagagggca 


aagatgttcc 


actgacaccg 


2560 


gaagatgacc 


ataatctata 


ctggttaaac 


cgggcgctaa 


aaggtcagcc 


actgcgtagt 


2640 


gaactctacg 


ggctggatgg 


cagcgcacag 


caacagatcc 


cctatacagt 


gactgaatcc 


2700 


cgtccacagg 


tgcgccaatt 


acaagatggc 


gccaccgttt 


ccccggtgct 


ctgggcctca 


2760 


gtcgtggaaa 


gccgtagtta 


tcactacgaa 


cgtattatca 


gtgatcccca 


gtgcaatcag 


2820 


gatatcacgt 


tgtccagtga 


cctattcggg 


caaccactga 


aacaggtttc 


cgtacaatat 


2880 


ccccgccgca 


acaaaccaac 


aaccaatccg 


tatcccgata 


ccctaccgga 


tacgctgttt 


2940 


gccagcagtt 


atgacgatca 


acaacagcta 


ttgcgattaa 


cctgccgaca 


atccagttgg 


3000 


caccatctta 


ttggtaatga 


gctaagagtg 


ttgggattac 


cggatggcac 


acgcagtgat 


3060 


gcctttactt 


acgatgccaa 


acaggtacct 


gtcgatggct 


taaatctgga 


aaccctgtgt 


3120 


gctgaaaata 


gcctgattgc 


cgatgataaa 


cctcgcgaat 


acctcaatca 


gcaacgaacg 


3180 


ttctataccg 


acgggaaaaa 


ccaaacaccg 


ctgaaaacac 


cgacacgaca 


agcgttaatc 


3240 


gcctttaccg 


aaacggcggt 


attaacggaa 


tctctgttat 


ccgcgtttga 


tggcggtatt 


3300 


acgccagacg 


aattaccggg 


aatactgaca 


caggccggat 


accaacaaga 


gccttatctg 


3360 


tttccacgca 


ccggcgaaaa 


caaagtttgg 


gtagcgcgtc 


aaggctatac 


cgattacggg 


3420 


acggaagcac 


aattttggcg 


tcctgtcgca 


caacgtaaca 


gcctgttaac 


cgggaaaatg 


3480 


acgttaaaat 


gggatactca 


ctattgtgtc 


atcacccaaa 


cccaagatgc 


tgccggcctc 


3540 


accgtctcag 


ccaattatga 


ctggcgtttt 


ctcacaccaa 


cgcaactgac 


tgacatcaac 


3600 


gataatgtgc 


atctcatcac 


cttggatgct 


ctgggacgcc 


ctgtcacgca 


acgtttctgg 


3660 


gggatcgaaa 


gcggtgtggc 


aacaggttac 


tcttcatcag 


aagaaaaacc 


attctctcca 


3720 


ccaaacgata 


tcgataccgc 


tattaatcta 


accggaccac 


tccctgtcgc 


acagtgtctg 


3780 


gtctatgcac 


cggacagttg 


gatgccacta 


ttcagtcaag 


aaaccttcaa 


cacattaacg 


3840 


caggaagagc 


aggagacgct 


gcgtgattca 


cgtattatca 


cggaagattg 


gcgtatttgc 


3900 


gcactgactc 


gccgccgttg 


gctacaaagt 


caaaagatca 


gtacaccatt 


agttaaactg 


3960 
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ttaaccaaca gcattggttt acctccccat aaccttacgc tgaccacaga ccgttatgac 4020 

cgcgactctg agcagcaaat tcgccaacaa gtcgcattta gtgatggttt tggccgtctg 4080 

ctacaagcgt ctgtacgaca tgaggcaggc gaagcctggc aacgtaacca agacggttct 4140 

ctggtgacaa aagtggagaa taccaaaacg cgttgggcgg tcacgggacg caccgaatat 4200 

gataataaag ggcaaacgat acgcacttat cagccctatt tcctcaacga ctggcgatat 4260 

gtcagtgatg acagcgccag aaaagaagcc tatgcggata ctcatattta tgatccaatt 4320 

gggcgagaaa tccgggttat tactgcaaaa ggctggctgc gccaaagcca atatttcccg 4380 

tggtttaccg tgagtgagga tgagaatgat acggccgctg atgcgctggt gtaa 4434 

<210> 43 
<211> 4425 
<212> DNA 

<213> Photorhabdus strain W14 
<400> 43 

atgcaaaatt cacaagattt tagtattacg gaactgtcac tgcccaaagg ggggggcgct 60 

atcacgggaa tgggtgaagc attaaccccc actggaccgg atggtatggc cgcgctatct 12 0 

ctaccattgc ctatttctgc cgggcgcggt tatgctcccg cattcactct gaattacaac 180 

agcggcgccg gtaacagtcc atttggtctg ggttgggatt gcaacgttat gactatccgc 24 0 

cgccgcaccc attttggcgt cccccattat gacgaaaccg ataccttttt ggggccagaa 300 

ggcgaagtgc tggtggtagc ggatcaacct cgcgacgaat ccacattaca gggtatcaat 360 

ttaggcgcca cctttaccgt taccggctac cgttcccgtc tggaaagcca tttcagccga 420 

ttggaatatt ggcaacccaa aacaacaggt aaaacagatt tttggttgat atatagccca 480 

gatgggcagg tgcatctact gggtaaatca ccgcaagcgc ggatcagcaa cccatcccaa 540 

acgacacaaa cagcacaatg gctgctggaa gcctctgtat catcacgtgg cgaacaaatt 600 

tattatcaat atcgcgccga agatgacaca ggttgcgaag cagatgaaat tacgcaccat 660 

ttacaggcta cagcgcaacg ttatttacac atcgtgtatt acggcaaccg tacagccagc 720 

gaaacattac ccggtctgga tggcagcgcc ccatcacaag cagactggtt gttctatctg 780 

gtatttgatt acggcgaacg cagtaacaac ctgaaaacgc caccagcatt ttcgactaca 840 

ggtagctggc tttgccgtca ggaccgtttt tcccgttatg aatatggctt tgagattcgt 900 

acccgccgct tatgccgtca ggtattgatg taccatcacc tgcaagcact ggatagtaag 960. 

ataacagaac acaacggacc aacgctggtt tcacgcctga tactcaatta cgacgaaagc 1020 

gcgatagcca gcacgctagt attcgttcgc cgagtgggac acgagcaaga tggtaatgtc 1080 

gtcaccctgc cgccattaga attggcatat caggattttt caccgcgaca tcacgctcac 1140 



i 
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tggcaaccaa 


tggatgtact 


ggcaaacttc 


aatgccattc 


agcgctggca 


gctagtcgat 


1200 


ctaaaaggcg 


aaggattacc 


cggcctgtta 


tatcaggata 


aaggcgcttg 


gtggtaccgc 


1260 


tccgcacagc 


gtctgggcga 


aattggctca 


gatgccgtca 


cttgggaaaa 


gatgcaacct 


1320 


ttatcggtta 


ttccttcttt 


gcaaagtaat 


gcctcgttgg 


tggatatcaa 


tggagacggc 


1380 


caacttgact 


gggttatcac 


cggaccggga 


ttacggggat 


atcatagtca 


acgcccggat 


1440 


ggcagttgga 


cacgttttac 


cccactcaac 


gctctgccgg 


tggaatacac 


ccatccacgc 


1500 


gcgcaacucg 


cagatttaat 


gggagccggg 


ctatccgatt 


tggtgctgat 


cggccctaag 


1560 


agcgtgcgtt 


tatatgccaa 


tacccgcgac 


ggctttgcca 


aaggaaaaga 


tgtggtgcaa 


1620 


tccggtgata 


tcacactgcc 


ggtgccgggc 


gccgatccac 


gtaagttggt 


ggcgtttagt 


1680 


gatgtattgg 


gttcaggtca 


agcccatctg 


gttgaagtaa 


gcgcgactaa 


agtcacctgc 


1740 


tggcctaatc 


tggggcgcgg 


acgttttggt 


caacccatta 


ccttaccggg 


attcagccag 


1800 


ccagcaaccg 


agtttaaccc 


ggctcaagtt 


tatctggccg 


atctggatgg 


cagcggtcca 


1860 


acggatctga 


tttatgttca 


tacaaaccgt 


ctggatatct 


tcctgaacaa 


aagtggcaat 


1920 


ggctttgctg 


aaccagtgac 


attacgcttc 


ccggaaggtc 


tgcgttttga 


tcatacctgt 


1980 


cagttacaaa 


tggccgatgt 


acaaggatta 


ggcgtcgcca 


gcctgatact 


gagcgtgccg 


2040 


catatgtctc 


cccatcactg 


gcgctgcgat 


ctgaccaaca 


tgaagccgtg 


gttactcaat 


2100 


gaaatgaaca 


acaatatggg 


ggtccatcac 


accttgcgtt 


accgcagttc 


ctcccaattc 


2160 


tggctggatg 


aaaaagccgc 


ggcgctgact 


accggacaaa 


caccggtttg 


ctatctcccc 


2220 


ttcccgatcc 


acaccctatg 


gcaaacggaa 


acagaagatg 


aaatcagcgg 


caacaaatta 


2280 


gtcacaacac 


ttcgttatgc 


tcgtggcgca 


tgggacggac 


gcgagcggga 


atttcgcgga 


2340 


tttggttatg 


tagagcagac 


agacagccat 


caactggctc 


aaggcaacgc 


gccagaacgt 


2400 


acgccaccgg 


cgctgaccaa 


aaactggtat 


gccaccggac 


tgccggtgat 


agataacgca 


2460 


ttatcaaccg 


agtattggcg 


tgatgatcag 


gcttttgccg 


gtttctcacc 


gcgctttacg 


2520 


acttggcaag 


ataacaaaga 


tgtcccgtta 


acaccggaag 


atgataacag 


tcgttactgg 


2580 


ttcaaccgcg 


cgttgaaagg 


tcaactgcta 


cgtagtgaac 


tgtacggatt 


ggacgatagt 


2640 


acaaataaac 


acgttcccta 


tactgtcact 


gaatttcgtt 


cacaggtacg 


tcgattacag 


2700 


cataccgaca 


gccgataccc 


tgtactttgg 


tcatctgtag 


ttgaaagccg 


caactatcac 


2760 


tacgaacgta 


tcgccagcga 


cccgcaatgc 


agtcaaaata 


ttacgctatc 


cagtgatcga 


2820 


tttggtcagc 


cgctaaaaca 


gctttcggta 


cagtacccgc 


gccgccagca 


gccagcaatc 


2880 


aatctgtatc 


ctgatacatt 


gcctgataag 


ttgttagcca 


acagctatga 


tgaccaacaa 


2940 


cgccaattac 


ggctcaccta 


tcaacaatcc 


agttggcatc 


acctgaccaa 


caataccgtt 


3000 
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cgagtattgg gattaccgga tagtacccgc agtgatatct ttacttatgg cgctgaaaat 3060 
gtgcctgctg gtggtttaaa tctggaactt ctgagtgata aaaatagcct gatcgcggac 3120 
gataaaccac gtgaatacct cggtcagcaa aaaaccgctt ataccgatgg acaaaataca 3180 
acgccgttgc aaacaccaac acggcaagcc ctgattgcct ttaccgaaac aacggtattc 3240 
aaccagtcca cattatcagc gtttaacgga agcatcccgt ccgataaatt atcaacgacg 3300 
ctggagcaag ctggatatca gcaaacaaat tatctattcc ctcgcactgg agaagataaa 3360 
gttfcgggtag cccatcacgg ctataccgat tatggtacag cggcacagtt ctggcgcccg 3420 
caaaaacaga gcaacaccca actcaccggt aaaatcaccc tcatctggga tgcaaactat 3480 

tgcgttgtgg tacaaacccg ggatgctgct ggactgacaa cctcagccaa atatgactgg 3540 

cgttttctga ccccggtgca actcaccgat atcaatgaca atcagcacct tatcacactg 3600 

gatgcattgg gccgaccaat cacattgcgc ttttggggaa ctgaaaacgg caagatgaca 3660 

ggttattcct caccggaaaa agcatcattt tctccaccat ccgatgttaa tgccgctatt 3720 

gagttaaaaa aaccgctccc tgtagcacag tgtcaggtct acgcaccaga aagctggatg 3780 

ccagtattaa gtcagaaaac cttcaatcga ctggcagaac aagattggca aaagttatat 3840 

aacgcccgaa tcatcaccga agatggacgt atctgcacac tggcttatcg ccgctgggta 3900 

caaagccaaa aggcaatccc tcaactcatt agcctgttaa acaacggacc ccgtttacct 3960 

cctcacagcc tgacattgac gacggatcgt tatgatcacg atcctgagca acagatccgt 4020 

caacaggtgg tattcagtga tggctttggc cgcttgctgc aagccgctgc ccgacatgag 4 080 

gcaggcatgg cccggcaacg caatgaagac ggctctttga ttataaatgt ccagcatact 4140 

gagaaccgtt gggcagtgac tggacgaacg gaatatgaca ataaggggca accgatacgt 4200 

acctatcagc cctatttcct caatgactgg cgatacgtca gcaatgatag tgcccggcag 4260 

gaaaaagaag cttatgcaga tacccatgtc tatgatccca taggtcgaga aatcaaggtt 4320 

atcaccgcaa aaggfctggtt ccgtcgaacc ttgttcactc cctggtttac tgtcaatgaa 4380 

gatgaaaatg acacagccgc tgaggtgaag aaggtaaaga tgtaa 4425 



<210> 44 
<211> 3132 
<212> DNA 

<213> Photorhabdus strain W14 
<400> 44 

atgagcccgt ctgagactac tctttatact caaaccccaa cagtcagcgt gttagataat 
cgcggtctgt ccattcgtga tattggtttt caccgtattg taatcggggg ggatactgac 
acccgcgtca cccgtcacca gtatgatgcc cgtggacacc tgaactacag tattgaccca 



60 
120 
180 
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cgcttgtatg 


atgcaaagca 


ggctgataac tcagtaaagc 


ctaattttgt 


ctggcagcat 


240 


gatctggccg 


gtcatgccct 


gcggacagag agtgtcgatg 


ctggtcgtac 


tgttgcattg 


300 


aatgatattg 


aaggtcgttc 


ggtaatgaca atgaatgcga 


ccggtgttcg 


tcagacccgt 


360 


cgctatgaag 


gcaacacctt 


gcccggtcgc ttgttatctg 


tgagcgagca 


agttttcaac 


420 


caagagagtg 


ctaaagtgac 


agagcgcttt atctgggctg 


ggaatacaac 


ctcggagaaa 


480 


gagtataacc 


tctccggtct 


gtgtatacgc cactacgaca 


cagcgggagt 


gacccggttg 


540 


atgagtcagt 


cactggcggg 


cgccatgcta tcccaatctc 


accaattgct 


ggcggaaggg 


600 


caggaggcta 


actggagcgg 


tgacgacgaa actgtctggc 


agggaatgct 


ggcaagtgag 


660 


gtctatacga 


cacaaagtac 


cactaatgcc atcggggctt 


tactgaccca aaccgatgcg 


720 


aaaggcaata 


ttcagcgtct 


ggcttatgac attgccggtc 


agttaaaagg gagttggttg 


780 


acggtgaaag 


gccagagtga 


acaggtgatt gttaagtccc 


tgagctggtc agccgcaggt 


840 


cataaattgc 


gtgaagagca 


cggtaacggc gtggttacgg 


agtacagtta 


tgagccggaa 


900 


actcaacgtc 


tgataggtat 


caccacccgg cgtgccgaag 


ggagtcaatc 


aggagccaga 


960 


gtattgcagg 


atctacgcta 


taagtatgat ccggtgggga 


atgttatcag 


tatccataat 


1020 


gatgccgaag 


ctacccgctt 


ttggcgtaat cagaaagtgg 


agccggagaa 


tcgctatgtt 


1080 


tatgattctc 


tgtatcagct 


tatgagtgcg acagggcgtg 


aaatggctaa 


tatcggtcag 


1140 


caaagcaacc 


aacttccctc 


acccgttata cctgttccta 


ctgacgacag 


cacttatacc 


1200 


aattaccttc 


gtacctatac 


ttatgaccgt ggcggtaatt 


tggttcaaat 


ccgacacagt 


1260 


tcacccgcga 


ctcaaaatag 


ttacaccaca gatatcaccg 


tttcaagccg 


cagtaaccgg 


1320 


gcggtattga 


gtacattaac 


gacagatcca acccgagtgg 


atgcgctatt 


tgattccggc 


1380 


ggtcatcaga 


agatgttaat 


accggggcaa aatctggatt 


ggaatattcg gggtgaattg 


1440 


caacgagtca 


caccggtgag 


ccgtgaaaat agcagtgaca 


gtgaatggta tcgctatagc 


1500 


agtgatggca 


tgcggctgct 


aaaagtgagt gaacagcaga 


cgggcaacag tactcaagta 


1560 


caacgggtga 


cttatctgcc 


gggattagag ctacggacaa 


ctggggttgc agataaaaca 


1620 


accgaagatt 


tgcaggtgat 


tacggtaggt gaagcgggtc 


gcgcacaggt 


aagggtattg 


1680 


cactgggaaa 


gtggtaagcc 


gacagatatt gacaacaatc 


aggtgcgcta 


cagctacgat 


1740 


aatctgcttg 


gctccagcca 


gcttgaactg gatagcgaag 


ggcagattct 


cagtcaggaa 


1800 


gagtattatc 


cgtatggcgg 


tacggcgata tgggcggcga 


gaaatcagac 


agaagccagc 


1860 


tacaaattta 


ttcgttactc 


cggtaaagag cgggatgcca 


ctggattgta 


ttattacggc 


1920 


taccgttatt 


atcaaccttg 


ggtgggtcga tggttgagtg 


ctgatccggc gggaaccgtg 


1980 
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gatgggctga 
ggattagcac 
cgtaaacctg 
attgccggcg 
gctatccccg 
tataacgtcg 
aaatcgacgt 
tatggcgcac 
gtgggatcat 
gcggtaggca 
gcagcggcgg 
ggtatccatg 
gctagtaacc 
gctgaaatgg 
gttgtcagcc 
cctgtctttg 
ctgcacagag 
ggtatagatc 
accgggatcg 
ttttctttgt 



atttgtaccg 
cgtctccaaa 
atgagggaat 
ggattgcgat 
tcattctggg 
gtagcctgct 
tagtacagtc 
gggcacaagg 
ggataaataa 
ccattgatac 
9tggggcggc 
ccggtattgg 
ccgccggaca 
ctgtcaacag 
catatgccgc 
agccgatatt 
tgatgggaag 
atgtcgctgg 
ctaatgcgat 



aatggtgagg 
tagaaatcga 
gtccgcgtca 
tggcggtctt 
ggttgcggcc 
ggaaaaaggc 
ggcggcfcggc 
tgtcggtgtt 
tgctgatcgg 
tatgttaggg 
gggtgggatg 
cacctattat 
tttagcgaat 
aataatgggt 
cggtttagcc 
tagtgttctc 
agagagttgg 
catgattggt 
agactatggc 



97 

aataacccca 



aatacatttt 
atgagacggg 
gcggctacca 
gtaggcgcgg 

ggggcattac 

gcggctgccg 
gcatcagccg 
gggattggcg 
actgcctcta 
atcaccggta 
ggctcctgga 
tacgcagtgg 
ggtggatttt 
agacaattag 
ggcgggcttg 
atttccagag 
aatcagatca 
accagtgctg 



tcacattgac 
ggtttgcttc 
gacaaaaaat 
ttgccgctac 
ggattggcgc 
ttgctcgact 
gagcgagttc 
ccggggcggt 
gcgctattgg 
cccttaccca 
cgcaagggag 
ttggttttgg 
gttatgccgc 
tgagtaggct 
tacatttcag 
tcggtggtat 
cgttaagtgc 
gaggcagggt 
tgggagccgc 



tgaccatgac 
atttttgttt 
tggcagagcc 
ggctggcgcg 
gttgatggga 
cgtacagggg 
agccgcggct 
aacaggggct 
ggccgggagt 
tgaagtcggg 
tactcgggca 
tttagatgtc 
tggtttgggt 
cttaggccgg 
tgtcgccaga 
tggaactggc 
tgccggtagt 
cttgaccaca 
acgacgagtt 



aa 



<210> 45 
<211> 2748 
<212> DNA 

<213> Photorhabdus strain W14 
<400> 45 

atgagcagtt acaattctgc aattgaccaa aagaccccct 
aggaaattaa atgtacgtac tttagaatat ctacgcactc 
gaattaatta cgttctatga gttcaatatt ccgggatttc 
cgtaaaaata aaaaccagag cggcccaaat ttcattcgtg 
gttttacgtg aagaaagtgt tgatgccggt cggactatta 
cgcccggtgt tgatcatcaa tgcaaccggt gtccgccaaa 
acccttcccg gtcgtctgct cgctatcacc gaacaagtac 



cgattaaggt 
aagctgacga 
aggtaaaaag 
tctttaatct 
ccctcaatga 
accatcgtta 
aggcaggaga 



attagataac 
aaacagtgat 
caccgatcct 
tgccggtcaa 
tattgaaagt 
tgaagataac 
gaaaacgacc 



2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3132 



60 
120 
180 
240 
300 
360 
420 



V 



WO 2004/002223 



POYUS2003/020082 



98 



gaacgtctta 


tctgggccgg 


caatacgccg caagaaaaag attacaacct 


cgccggtcag 


480 


tgtgtccgcc 


attacgatac 


cgcgggactt actcaactca atagcctttc 


tctggctggc 


54 0 


at cat ac tat 


cacaatctca 


gcaactactc gtcgatgata aaaatgctga 


ctggacaggt 


600 


gaagaccaaa 


acctctaaca 


gcaaaaactg agcagtgatg tctataccac 


ccaaaataaa 


660 


accaatacca 


c caaaac 1 1 1 


attgacccag accgatgcca aaggcaacat 


ccagcgtctg 


720 


occtacaaca 


tacccoaaca 


gctaaaaggc tgttggttga cactcaaagg 


tcaqgccqaq 


780 


caaataatta 


tcaaafccacc 


ydCctaCtcc gccgccggac aaaaattacg 


cciaagaonaf! 
" — — _/ 


84 0 


yyu.ctctL.yyyy 


ttatcactaa 


atacaactat aaaccaaaaa cccaacaact 


tatcggtatt 


900 


£Tf , r , ar*f*r , ar' , r* 

MVrfUQL L_r v*r ^ 


at cccrtcaaa 


cgccaaagtg ttgcaagact tacgctatca 


atatgacccg 


960 


y ^^yy v^oa L.y 


taatcaatat 


ccgtaatgat gcggaagcca cccgcttttg 


gcgcaatcag 


1020 


oca. ay cyy luu 


pooaaaahao 


ctatacefcac aactccctat atcaacttat 


cagtgccacc 


1080 


yyy ^-yy y aac * 


fc aar*t* a a t at" 


aaotcaacaa aataaccaac baccctcccc 


tgcgctacct 


114 0 


f- r»1~ as r*a a ra 


atacctacac 


taactatact cacaactaca actataatca 


cagtggtaat 


1200 


i— y Q.\— y oaaa 


1 1 - caacacaa 


ctcoccaact acccaaaaca actacaccat 


ggctatcacc 


1260 


t»l_L»L.t_aaaV_l~ 


crraacaatca 

y w«ay i-c»o i»v*y 


aaatattctc aatacoctaa ccacccratcc 


aaatcaagtg 


1320 


yai-ai—y l. tyu 


ttoataccacr 

u. u.ycn-yL.uyy 


tacrtcaccaa accacrtttat tacccacraca 


gacacttatc 


1380 


t" nna ra rra c 
uyya wat-v-at. 


na crna oa cr 1 1~ 


aaaacaaatt aataataacc caaoaaataa 


ataataccac 

3 «-33 >.»^v»3k. 


1440 


l. ci i—y a l, ay i_a 


a ronra t* cracr 


araactaaaa ataaotaaac aoccaaccca 


aaatactaca 


1500 


ranraaraar 


aaat aa t r*t a 

yyyuaaui-ua 


tttaccoaaa ctooaoctac acacaaccca 


gagcaacgcc 


1560 


araaraarao 


aaaaaftaca 


cgttatcaca ctcggtgaag ccggtcgcgc 


acaggtacgg 


1620 


Qfal" f- nfa r»t" 
y u L- ^ c* i- 


aaaa op aa 
yyy a y°y u yy 


taagccagaa gatgtcaaca ataatcaact 


acgttacagc 


1680 


tacgataatc 


tgatcggctc 


cagccagctt gaactggaca accaaggaca 


aattatcagc 


1740 


aaaaaaoaot 


attatccatt 


tggcgggaca gcgctgtggg cagcaaacag 


ccaaacagaa 


1800 


gccagctata 


aaacgattcg 


ctattccggc aaagaacgag atgccaccgg 


gttgtattat 


1860 


tacggttatc 


gttattacca 


accgtgggcg ggcagatggt taagcgcgga 


cccggcagga 


1920 


accattgatg 


ggctgaatct 


ataccgaatg gtaagaaata atcctgtgag 


tttacaagat 


1980 


gaaaatggat 


tagcgccaga 


aaaagggaaa tataccaaag aggtaaattt 


ctttgatgaa 


2040 


ttaaaattca 


aattggcagc 


caaaagttca catgttgtca aatggaacga 


gaaagagagc 


2100 


agttatacaa 


aaaataaatc 


attgaaagtg gttcgtgtcg gtgattccga 


tccgtcgggt 


2160 


tatttgctaa 


gccacgaaga 


gttactaaaa ggtatagaaa aaagtcaaat 


catatatagc 


2220 


cgacttgaag 


aaaacagctc 


cctttcagaa aaatcaaaaa cgaatctttc 


tttaggatct 


22B0 
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gaaatatccg gttatatggc aagaaccata caagatacga tatcagaata tgccgaagag 2340 

cataaatata gaagtaatca ccctgatttt tattcagaaa ccgatttctt tgcgttaatg 2400 

gataaaagtg aaaaaaatga ttattccggt gaaagaaaaa tttatgcggc aatggaggtt 2460 

aaggtttatc atgatttaaa aaataaacaa tcagaattac atgtcaacta tgcattggcc 2520 

catccctata cgcaattgag taatgaagaa agagcgctgt tgcaagaaac agaacccgct 2580 

attgcaatag atagagaata taatttcaaa ggtgttggca aattcctgac aatgaaagca 2640 

attaaaaaat cattgaaagg acataaaatt aataggatat caacagaggc tattaatatt 2700 

cgctctgcgg ctatcgctga gaatttagga atgcggagaa cttcataa 2748 

<210> 46 
<211> 2883 
<212> DNA 

<213> Photorhabdus strain W14 
<400> 46 

atgaaaaaca ttgatcccaa actttatcaa aaaaccccta ctgtcagcgt ttacgataac 60 

cgtggtctga taatccgtaa catcgatttt catcgtacta ccgcaaatgg tgatcccgat 120 

acccgtatta cccgccatca atacgatatt cacggacacc taaatcaaag catcgatccg 180 

cgcctatatg aagccaagca aaccaacaat acgatcaaac ccaattttct ttggcagtat 240 

gatttgaccg gtaatcccct atgtacagag agcattgatg caggtcgcac tgtcaccttg 300 

aatgatattg aaggccgtcc gctactaacg gtgactgcaa caggggttat acaaactcga 360 

caatatgaaa cttcttccct gcccggtcgt ctgttatctg ttgccgaaca aacacccgag 420 

gaaaaaacat cccgtatcac cgaacgcctg atttgggctg gcaataccga agcagagaaa 480 

gaccataacc ttgccggcca gtgcgtgcgt cactatgaca cggcgggagt tacccggtta ,540 

gagagtttat cactgaccgg tactgtttta tctcaatcca gccaactatt gatcgacact 600 

caagaggcaa actggacagg tgataacgaa accgtctggc aaaacatgct ggctgatgac 660 

atctacacaa ccctgagcac cttcgatgcc accggtgctt tactgactca gaccgatgcg 720 

aaagggaaca ttcagagact ggcttatgat gtggccgggc agctaaacgg gagctggcta 780 

acactcaaag gccagacgga acaagtgatt atcaaatccc tgacctactc cgccgccgga 840 

caaaaattac gtgaggaaca cggcaatgat gttatcaccg aatacagtta tgaaccggaa 900 

acccaacggc tgatcggtat caaaacccgc cgtccgtcag acactaaagt gctacaagac 960 

ctgcgctatg aatatgaccc ggtaggcaat gtcatcagca tccgtaatga cgcggaagcc 1020 

acccgctttt ggcacaatca gaaagtgatg ccggaaaaca cttataccta cgattccctg 1080 

tatcagctta tcagcgccac cgggcgcgaa atggcgaata taggtcaaea aagtcaccaa 114 0 
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100 



tttccctcac 


ccgctctacc ttctgataac 


aacacctata 


ccaactatac 


ccgtacttat 


1200 


acttatgacc 


gtggcggcaa tctgaccaaa 


atccagcaca 


gttcaccggc 


gacgcaaaac 


1260 


aactacacca 


ccaatatcac ggtttcaaat 


cgcagcaacc 


gcgcagtact 


cagcacattg 


1320 


accgaagatc 


cggcgcaagt agatgctttg 


tttgatgcag 


gcggacatca 


gaacaccttg 


1380 


atatcaggac 


aaaacctgaa ctggaatact 


cgtggtgaac 


tgcaacaagt 


aacactggtt 


1440 


aaacgggaca 


agggcgccaa tgatgatcgg 


gaatggtatc 


gttatagcgg 


tgacggaaga 


1500 


aggatgttaa 


aaatcaatga acagcaggcc 


agcaacaacg 


ctcaaacaca 


acgtgtgact 


1560 


tatttgccga 


acttagaact tcgtctaaca 


caaaacagca 


cggccacaac 


cgaagatttg 


1620 


caagttatca 


ccgtaggcga agcgggccgg 


gcacaggtac 


gagtattaca 


ttgggagagc 


1680 


ggtaaaccgg 


aagatatcga caataatcag 


ttgcgttata 


gttacgataa 


tcttatcggt 


1740 


tccagtcaac 


ttgaattaga tagcgaagga 


caaattatca 


gtgaagaaga 


atattatccc 


1800 


tatggtggaa 


cagcattatg ggccgccagg 


aatcagacag 


aagccagtta 


taaaaatatc 


1860 


cgttattcag 


gcaaagagcg ggatgccacc 


gggctatatt 


actacggcta 


tcggtattac 


1920 


caaccgtgga 


taggacggtg gttaagctcc 


gatccggcag 


gaacaatcga 


tgggctgaat 


1980 


ttatatcgga 


tggtgaggaa taatccagtt 


accctccttg 


atcctgatgg 


attaatgcca 


2040 


acaattgcag 


aacgcatagc agcactaaaa 


aaaaataaag 


taacagactc 


agcgccttcg 


2100 


ccagcaaatg 


ccacaaacgt agcgataaac 


atccgcccgc 


ctgtagcacc 


aaaacctagc 


2160 


ttaccgaaag 


catcaacgag tagccaacca 


accacacacc 


ctatcggagc 


tgcaaacata 


2220 


aaaccaacga 


cgtctgggtc atctattgtt 


gctccattga 


gtccagtagg 


aaataaatct 


2280 


acttctgaaa 


tctctctgcc agaaagcgct 


caaagcagtt 


cttcaagcac 


tacctcgaca 


2340 


aatctacaga 


aaaaatcatt tactttatat 


agagcagata 


acagatcctt 


tgaagaaatg 


2400 


caaagtaaat 


tccctgaagg atttaaagcc 


tggactcctc 


tagacactaa 


gatggcaagg 


2460 


caatttgcta 


gtatctttat tggtcagaaa 


gatacatcta 


atttacctaa 


agaaacagtc 


2520 


aagaacataa 


gcacatgggg agcaaagcca 


aaactaaaag 


atctctcaaa 


ttacataaaa 


2580 


tataccaagg 


acaaatctac agtatgggtt 


tctactgcaa 


ttaatactga 


agcaggtgga 


2640 


caaagctcag 


gggctccact ccataaaatt 


gatatggatc 


tctacgagtt 


tgccattgat 


2700 


ggacaaaaac 


taaatccact accggagggt 


agaactaaaa 


acatggtacc 


ttccctttta 


2760 


ctcgacaccc 


cacaaataga gacatcatcc 


atcattgcac 


ttaatcatgg 


accggtaaat 


2820 


gatgcagaaa 


tttcatttct gacaacaatt 


ccgcttaaaa 


atgtaaaacc 


tcataagaga 


2880 


taa 










2883 
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<210> 47 
<211> 2850 
<212> DNA 

<213> Photorhabdus strain W14 
<400> 47 

atgaaaaaca ttgacccaaa actttatcaa catacgccca ccgttaacgt ctacgataac 60 

cgtggcctga ccattcgtaa catcgacttt caccgtgacg tcgcgggagg cgatacagat 12 0 

actcgtatta cccgccacca atatgatacc cgaggacact tgagccaaag cattgatcca 180 

cggctgtatg acgccaaaca aaccaataac tcgacaaacc ccaacttcct ctggcaatac 240 

aatctcaccg gcgacacttt gcggacagaa agtgtcgatg ccggccgtac cgtagccctc 300 

aatgatattg aaggccgtca agtgttgatt gtaaccgcaa ccggcgccat tcagacccga 360 

caatatgaag ccaataccct gcccggtcgt ctattatccg taagtgaaca agcccccgga 420 

gaacagactc cccgcgttac tgagcatttt atttgggctg gtaatacaca ggcggagaaa 480 

gatcataatc ttgccggcca gtatgtgcgc cactacgaca cagcaggagt gacgcaactg 540 

gaaagcctgt cattgacaga aaacatctta tctcaatccc gtcagttatt agccgacggt 600 

caggaagcag actggacagg taacgatgaa accctctggc agaccaaact caatagcgaa 660 

acttacacga cacaaagcac ctttgatgct accggcgctt tgctgaccca aaccgatgca 720 

aaaggcaaca tgcaacgtct ggcttacaac gtggcaggac aattacaagg tagctggctg 780 

acattgaaaa accaaagtga gcaagtcatt gtcaaatccc tgacctattc cgccgcaggc 840 

cagaaattgc gtgaagaaca cggtaatggc gttatcactg aatacagcta tgaaccggaa 900 

actctacgat tgatcggtac cactactcgc cgtcaatcag atagcaaggt gttacaagat 960 

ctacgctatg aacatgatcc tgtagggaat attattagtg tccgtaatga tgcagaagcc 1020 

acccgcttct ggcgcaatca gaaaatagtc cctgaaaata cctacaccta cgattccctg 1080 

tatcagctta tcagtgcaac aggacgtgag atggctaaca tcggccagca aagcaaccaa 114 0 

cttccttcgc caatcatccc tcttcctact gatgaaaact catataccaa ctatactcgc 1200 

agctataatt acgatcgcgg cggcaatttg gttcaaatcc ggcacagttc ccccgccgcc 1260 

caaaataact acaccacaga tatcaccgtt tcgaatcgca gtaaccgggc agtgctgagt 1320 

tcgctaacct cagacccaac acaggtggag gcactgtttg atgccggcgg acatcaaaca 1380 

aaattgttac cggggcaaga gctgagttgg aatacacgag gtgaactaaa acaggtaacg 1440 

ccagtcagtc gcgagagcgc cagcgatcgg gaatggtatc gttacggcaa cgacggcatg 1500 

cgacggttaa aagtcagtga gcaacagact ggcaacagca cgcagcagca acgagtaact 1560 

tatcttcccg atctggagct acgtacaaca caaaatggga ctactacatc agaagacctg 1620 
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catgctatta 


ccgtgggagc 


agcaggccac 


102 
gcacaagtgc 


gagttctaca 


ctgggaaact 


1680 


acgccaccag 


ccggtatcaa 


taacaatcag 


cttcgctata 


gctatgataa 


tttgattggt 


1740 


tccagtcaac 


ttgaactgga 


taacgcagga 


caaattatca 


gtcaggaaga 


gtattatcca 


1800 


tttggcggca 


cagcattatg 


ggcagcaaga 


aaccaaatag 


aagccagcta 


caaaatcctc 


1860 


cgttactcag 


gtaaagaacg 


cgatgctacc 


gggctctatt 


attacggcta 


ccgctattat 


1920 


cagccgtggg 


ttggtaggtg 


gttaagcgcc 


gatccggctg 


gaacaatcga 


tggactgaat 


1980 


ctataccgga 


tggtgagaaa 


taatccgtca 


acactyyttg 


— i *-■»-*- n «. nr ~r 


gcttgcacct 


2040 


acgaaataca 


atattcccgg 


atttgacttt 


gatgtagaaa 


tagatgagca 


aaaaagatct 


2100 


aaattaaaac 


caacgttgat 


aagaatcaaa 


gatgaatttt 


tacattatgg 


tcctgtagat 


2160 


aagctgttag 


aagaaaaaaa 


acccggcctc 


aatgtaccag 


aggagctatt 


tgatagaggt 


2220 


ccatccgaga 


atggagtgtc 


aacattaact 


ttcaaaaaag 


acctaccgat 


aagttgtatt 


2280 


agcaacacag 


aatataccct 


tgatatctta 


tacaacaaac 


atgagactaa 


accattccct 


2340 


tacgaaaacg 


aagcaacagt 


tggcgcagat 


ctgggagtaa 


taatgtccgt 


ggagtttgga 


2400 


aataaatcaa 


taggtaatgc 


ctctgacgaa 


gatttaaaag 


aagaacatct 


cccattagga 


2460 


aaatccacaa 


tggataaaac 


agacctgcca 


gatttaaaac 


aagggctaat 


gatcgcggag 


2520 


aagataaaaa 


gtggaaaagg 


ggcatatcct 


tttcattttg 


gtgctgcaat 


agctgttgta 


2580 


tatggtgagg 


ataaaaaagt 


agccgcttca 


attctgacag 


arccaccuga 


dCCLdddaya 


Z Oft u 


gacgaaggcg 


agtatttgca 


aagtacgaga 


aaggtaagcg 


caatgtttat 


cacaaacgtc 


2700 


aatgaatttc 


gcggccatga 


ttacccaaaa 


agtaaatata 


gtatcggatt 


agttacagct 


2760 


gaaaaacgtc 


agccagtaat 


aagcaaaaaa 


cgtgcaaacc 


cggaagaggc 


cccttcatca 


2820 


tccagaaata 


aaaaattgca 


tgtacattaa 








2850 


<210> 48 
<211> 2817 
<212> DNA 

<213> Photorhabdus strain W14 










<400> 48 
atggaaaaca 


ttgacccaaa actttatcac 


catacgccta 


ccgtcagtgt 


tcacgataac 


60 


cgtggactag ctatccgtaa tattagtttt 


caccgcacta 


ccgcagaagc 


aaataccgat 


120 


acccgtatta 


cccgccatca atataatgcc 


ggcggatatt 


tgaaccaaag 


cattgatcct 


180 


cgcctgtatg acgccaaaca gactaacaac 


gctgtacaac 


cgaattttat 


ctggcgacat 


240 


aatttgaccg 


gcaatatcct 


gcgaacagag 


agcgtcgatg 


ccggtcggac 


gattaccctc 


300 


aacgatattg aaggccgccc ggtgttgacc 


atcaatgcag 


ccggtgtccg 


gcaaaaccat 


360 
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cgctacgaag ataacaccct gcccggtcgc ctgctcgcta tcagcgaaca aggacaggca 420 
gaagagaaaa cgaccgagcg ccttatctgg gccggcaata cgccgcaaga aaaagaccac 480 
aaccttgccg gtcagtgcgt ccgccattac gataccgcag gactcactca actcaacagc 540 
cttgccctga ccggcgccgt tctatcacaa tctcaacaac tgcttaccga taaccaggat 600 
gccgactgga caggtgaaga ccagagcctc tggcaacaaa aactgagtag tgatgtctat 660 
atcacccaaa gtaacactga tgccaccggg gctttactga cccagaccga tgccaaaggc 720 
aacattcagc ggctggccta tgatgtggcc gggcagctaa aagggagttg gttaacactc 780 
aaaggtcagg cggaacaggt gattatcaaa tcgctaacct actccgccgc cgggcaaaaa 840 
ttacgtgaag agcacggtaa cgggattgtc actgaataca gctacgaacc ggaaacccaa 900 
cggcttatcg gcattaccac tcgccgtcca tcagacgcca aggtgttgca agacctacgc 960 

tatcaatatg acccagtagg caatgtcatt agtatccgta atgatgcgga agccactcgc 1020 

ttttggcgca atcagaaagt agccccggag aatagctata cctacgattc cctgtatcag 1080 

cttatcagcg ccaccgggcg cgagatggcc aatatcggtc agcaaagcaa ccaacttccc 1140 

tctccggcgc taccttctga taacaatacc tacaccaact atactcgcac ttatacttat 1200 

gaccgtggcg gcaatttgac gaaaattcag catagttcac cagccgcgca aaataactac 1260 

acgacggata taacggtttc aaatcgcagc aaccgcgcgg tactcagcac attgaccgca 1320 

gatccaactc aagtcgatgc cttatttgat gcgggaggcc atcaaaccag cttgttatcc 1380 

ggccaagttc taacttggac accgcgaggc gaattgaaac aagccaacaa tagcgcagga 1440 

aatgagtggt atcgctacga tagcaacggc atacgccagc taaaagtgaa tgaacaacaa 1500 

actcagaata tcccgcaaca acaaagggta acttatctac cggggctgga aatacgtaca 1560 

acccagaaca acgccacaac aacagaagag ttacacgtta tcacactcgg taaagccggc 1620 

cgcgcgcaag tccgagtatt gcattgggag agcggtaaac cagaagatat taataacaat 1680 

cagcttcgtt acagctacga taatcttatt ggctccagcc aacttcaatt agatagcgac 1740 

ggacaaatta tcagtgaaga agaatattat ccatttggtg gtacagcgct gtgggcggca 1800 

aggaatcaaa ccgaagccag ctataaaacc attcgttatt ctggtaaaga gcgggatgtt 1860 

accgggctgt attattatgg ctaccgttat taccaaccgt gggcgggcag atggttaagt 1920 

gcagacccgg caggaaccat tgatggactg aatttatatc gcatggtgag aaataacccg 1980 

gtgacgcaat ttgatgttca gggattatca ccggccaaca gaacagaaga agcgataata 204 0 

aaacagggtt cctttacggg aatggaagaa gctgtttata aaaaaatggc taaacctcaa 2100 

actttcaaac gccaaagagc tatcgctgcc caaacagagc aagaagccca tgaatcattg . 2160 

accaacaacc ctagtgtaga tattagccca attaaaaact acaccacaga tagctcacaa 2220 



WO 2004/002223 



PCTAJS2003/020082 



104 



attaatgccg 


cgataaggga 


aaatcgtatt 


acgccagcag 


tggaaagttt 


agacgccaca 


2280 


ttatcttccc 


tacaagatag 


acaaatgagg 


gtaacttatc 


gggtgatgac 


ctatgtagat 


2340 


aattccacgc 


catcgccttg 


gcactcgcca 


caggaaggaa 


atagtattaa 


tgttggtgat 


2400 


atcgtttcgg 


ataacgctta 


tttatcaaca 


tcggcccatc 


gtggttttct 


gaattttgtt 


2460 


cacaaaaaag 


aaaccagtga 


aactcgatac 


gtcaagatgg 


catttttaac 


gaatgcgggt 


2520 


gtcaatgtct 


cagcagcatc 


tatgtataat 


aatgctggcg 


aggagcaagt 


atttaaaatg 


2580 


gatttaaacg 


attcaagaaa 


aagccttgct 


gaaaaattaa 


aactaagagt 


cagtggacca 


2640 


caatcgggac 


aagcggaaat 


attactacct 


agggaaacac 


agttcgaagt 


tgtttcaatg 


2700 


aaacatcaag 


gcagagatac 


ctatgtatta 


ttgcaagata 


ttaaccaatc 


cgcagccact 


2760 


catagaaatg 


tacgtaacac 


ttacaccggt 


aatttcaaat 


catccagtgc 


aaattaa 


2817 



<210> 49 

<211> 2538 

<212> PRT 

<213> Xenorhabdus nematophilus 

<400> 49 

Met Tyr Ser Thr Ala Val Leu Leu Asn Lys lie Ser Pro Thr Arg Asp 
1 5 10 15 

Gly Gin Thr Met Thr Leu Ala Asp Leu Gin Tyr Leu Ser Phe Ser Glu 
20 25 30 

Leu Arg Lys lie Phe Asp Asp Gin Leu Ser Trp Gly Glu Ala Arg His 
35 40 45 

Leu Tyr His Glu Thr He Glu Gin Lys Lys Asn Asn Arg Leu Leu Glu 
50 55 60 

Ala Arg He Phe Thr Arg Ala Asn Pro Gin Leu Ser Gly Ala He Arg 
65 70 75 80 

Leu Gly He Glu Arg Asp Ser Val Ser Arg Ser Tyr Asp Glu Met Phe 
85 90 95 

Gly Ala Arg Ser Ser Ser Phe Val Lys Pro Gly Ser Val Ala Ser Met 
100 105 HO 



Phe Ser Pro Ala Gly Tyr Leu Thr Glu Leu Tyr Arg Glu Ala Lys Asp 
115 120 125 
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Leu His Phe Ser Ser Ser Ala Tyr His lieu Asp Asn Arg Arg Pro Asp 
130 135 140 



Leu Ala Asp Leu Thr Leu Ser Gin Ser Asn Met Asp Thr Glu He Ser 
145 150 155 leo 

Thr Leu Thr Leu Ser Asn Glu Leu Leu Leu Glu His He Thr Arg Lys 
165 170 175 

Thr Gly Gly Asp Ser Asp Ala Leu Met Glu Ser Leu Ser Thr Tyr Arq 
ISO 185 190 

Gin Ala He Asp Thr Pro Tyr His Gin Pro Tyr Glu Thr He Arg Gin 
155 200 205 

Val He Met Thr His Asp Ser Thr Leu Ser Ala Leu Ser Arg Asn Pro 
210 215 220 



Glu Val Met Gly Gin Ala Glu Gly Ala Ser Leu Leu Ala He Leu Ala 
225 230 235 240 

Asn He Ser Pro Glu Leu Tyr Asn lie Leu Thr Glu Glu He Thr Glu 
245 250 255 

Lys Asn Ala Asp Ala Leu Phe Ala Gin Asn Phe Ser Glu Asn He Thr 
260 265 270 

Pro Glu Asn Phe Ala Ser Gin Ser Trp He Ala Lys Tyr Tyr Gly Leu 
275 280 285 



Glu Leu Ser Glu Val Gin Lys Tyr Leu Gly Met Leu Gin Asn Gly Tyr 
290 295 300 



Ser Asp Ser Thr Ser Ala Tyr Val Asp Asn He Ser Thr Gly Leu Val 
305 310 315 320 

Val Asn Asn Glu Ser Lys Leu Glu Ala Tyr Lys He Thr Arg Val Lys 
3 25 330 335 



Thr Asp Asp Tyr Asp Lys Asn He Asn Tyr Phe Asp Leu Met Tyr Glu 
340 345 350 



Gly Asn Asn Gin Phe Phe He Arg Ala Asn Phe Lys Val Ser Arg Glu 
355 36Q 365 



Phe Gly Ala Thr Leu Arg Lys Asn Ala Gly Pro Ser Gly He Val Gly 
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370 



375 



380 



Ser Leu Ser Gly Pro Leu lie Ala Asn Thr Asn Phe Lys Ser Asn Tyr 
385 390 395 400 



Leu Ser Asn lie Ser Asp Ser Glu Tyr Lys Asn Gly Val Lys lie Tyr 
405 410 415 



Ala Tyr Arg Tyr Thr Ser Ser Thr Ser Ala Thr Asn Gin Gly Gly Gly 
420 425 130 



lie Phe Thr Phe Glu Ser Tyr Pro Leu Thr lie Phe Ala Leu Lys Leu 
435 440 445 



Asn Lys Ala lie Arg Leu Cys Leu Thr Ser Gly Leu Ser Pro Asn Glu 
450 455 460 



Leu Gin Thr lie Val Arg Ser Asp Asn Ala Gin Gly lie lie Asn Asp 
465 470 475 480 



Ser Val Leu Thr Lys Val Phe Tyr Thr Leu Phe Tyr Ser His Arg Tyr 
485 490 495 



Ala Leu Ser Phe Asp Asp Ala Gin Val Leu Asn Gly Ser Val lie Asn 
500 505 510 



Gin Tyr Ala Asp Asp Asp Ser Val Ser His Phe Asn Arg Leu Phe Asn 
515 520 525 



Thr Pro Pro Leu Lys Gly Lys lie Phe Glu Ala Asp Gly Asn Thr Val 
530 535 540 



Ser lie Asp Pro Asp Glu Glu Gin Ser Thr Phe Ala Arg Ser Ala Leu 
545 550 555 560 



Met Arg Gly Leu Gly Val Asn Ser Gly Glu Leu Tyr Gin Leu Gly Lys 
565 570 575 



Leu Ala Gly Val Leu Asp Ala Gin Asn Thr He Thr Leu Ser Val Phe 
580 585 590 



Val He Ser Ser Leu Tyr Arg Leu Thr Leu Leu Ala Arg Val His Gin 
595 600 605 



Leu Thr Val Asn Glu Leu Cys Met Leu Tyr Gly Leu Ser Pro Phe Asn 
610 615 620 . 
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Gly Lys Thr Thr Ala Ser Leu Ser Ser Gly Glu Leu Pro Arg Leu Val 
625 «0 635 640 

He Trp Leu Tyr Gin Val Thr Gin Trp Leu Thr Glu Ala Glu He Thr 
645 650 

Thr Glu Ala He Trp Leu Leu Cys Thr Pro Glu Phe Ser Gly Asn lie 
660 665 670 

Ser Pro Glu He Ser Asn Leu Leu Asn Asn Leu Arg Pro Ser He Ser 
675 680 685 



Glu Asp Met Ala Gin Ser His Asn Arg Glu Leu Gin Ala Glu He Leu 
690 695 



700 



Ala Pro Phe He Ala Ala Thr Leu His Leu Ala Ser Pro Asp Met Ala 
7 °5 710 tic 



715 



720 



Arg Tyr He Leu Leu Trp Thr Asp Asn Leu Arg Pro Gly Gly Leu Asp 
72 5 730 735 

He Ala Gly Phe Met Thr Leu Val Leu Lys Glu Ser Leu Asn Ala Asn 
740 745 750 

Glu Thr Thr Gin Leu Val Gin Phe Cys His Val Met Ala Gin Leu Ser 
755 760 765 

Leu Ser Val Gin Thr Leu Arg Leu Ser Glu Ala Glu Leu Ser Val Leu 
770 775 780 

Val He Ser Gly Phe Ala Val Leu Gly Ala Lys Asn Gin Pro Ala Gly 
785 790 795 800 

Gin His Asn He Asp Thr Leu Phe Ser Leu Tyr Arg Phe His Gin Trp 
805 810 815 



He Asn Gly Leu Gly Asn Pro Gly Ser Asp Thr Leu Asp Met Leu Arg 
820 — *" ' ~ v 3 



825 



830 



Gin Gin Thr Leu Thr Ala Asp Arg Leu Ala Ser Val Met Gly Leu Asp 



840 



845 



lie Ser Met Val Thr Gin Ala Met Val Ser Ala Gly Val Asn Gin Leu 
850 855 



860 
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Gin Cys Trp Gin Asp He Asn Thr Val Leu Gin Trp He Asp Val Ala 
865 870 875 880 



Ser Ala Leu His Thr Met Pro Ser Val He Arg Thr Leu Val Asn He 
885 890 895 



Arg Tyr Val Thr Ala Leu Asn Lys Ala Glu Ser Asn Leu Pro Ser Trp 
900 905 910 



Asp Glu Trp Gin Thr Leu Ala Glu Asn Met Glu Ala Gly Leu Ser Thr 
915 920 925 



Gin Gin Ala Gin Thr Leu Ala Asp Tyr Thr Ala Glu Arg Leu Ser Ser 
930 935 940 



Val Leu Cys Asn Trp Phe Leu Ala Asn He Gin Pro Glu Gly Val Ser 
945 950 955 960 



Leu His Ser Arg Asp Asp Leu Tyr Ser Tyr Phe Leu He Asp Asn Gin 
965 970 " 975 



Val Ser Ser Ala He Lys Thr Thr Arg Leu Ala Glu Ala lie Ala Gly 
980 985 990 



He Gin Leu Tyr He Asn Arg Ala Leu Asn Arg He Glu Pro Asn Ala 
995 1000 1005 



Arg Ala Asp Val Ser Thr Arg Gin Phe Phe Thr Asp Trp Thr Val 
1010 1015 1020 



Asn Asn Arg Tyr Ser Thr Trp Gly Gly Val Ser Arg Leu Val Tyr 
1025 1030 1035 



Tyr Pro Glu Asn Tyr He Asp Pro Thr Gin Arg He Gly Gin Thr 
1040 1045 1050 



Arg Met Met Asp Glu Leu Leu Glu Asn He Ser Gin Ser Lys Leu 
1055 1060 1065 



Ser Arg Asp Thr Val Glu Asp Ala Phe Lys Thr Tyr Leu Thr Arg 
1070 1075 1080 



Phe Glu Thr Val Ala Asp Leu Lys Val Val Ser Ala Tyr His Asp 
1085 1090 1095 
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Asn Val Asn Ser Asn Thr Gly Leu Thr Trp Phe Val Gly Gin Thr 
H00 1105 mo 

Arg Glu Asn Leu Pro Glu Tyr Tyr Trp Arg Asn Val Asp He Ser 
1115 1120 H25 

Arg Met Gin Ala Gly Glu Leu Ala Ala Asn Ala Trp Lys Glu Trp 
1130 1135 H40 

Thr Lys He Asp Thr Ala Val Asn Pro Tyr Lys Asp Ala He Arg 
1145 H50 ii 5 5 

Pro Val He Phe Arg Glu Arg Leu His Leu He Trp Val Glu Lys 
1160 ' H65 H70 

Glu Glu Val Ala Lys Asn Gly Thr Asp Pro Val Glu Thr Tyr Asp 
1175 H80 lies 

Arg Phe Thr Leu Lys Leu Ala Phe Leu Arg His Asp Gly Ser Trp 
1190 H95 1200 

Ser Ala Pro Trp Ser Tyr Asp He Thr Thr Gin Val Glu Ala Val 
1205 1210 1215 

Thr Asp Lys Lys Pro Asp Thr Glu Arg Leu Ala Leu Ala Ala Ser 
1220 1225 1230 

Gly Phe Gin Gly Glu Asp Thr Leu Leu Val Phe Val Tyr Lys Thr 
1235 1240 1245 

Gly Lys Ser Tyr Ser Asp Phe Gly Gly Ser Asn Lys Asn Val Ala 
1250 1255 1260 

Gly Met Thr He Tyr Gly Asp Gly Ser Phe Lys Lys Met Glu Asn 
1265 1270 1275 

Thr Ala Leu Ser Arg Tyr Ser Gin Leu Lys Asn Thr Phe Asp He 
1280 1285 " 1290 

He His Thr Gin Gly Asn Asp Leu Val Arg Lys Ala Ser Tyr Ara 
1295 1300 i 3 05 * 

Phe Ala Gin Asp Phe Glu Val Pro Ala Ser Leu Asn Met Gly Ser 
1310 1315 1320 



Ala He Gly Asp Asp Ser Leu Thr Val Met Glu Asn Gly Asn He 
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1325 1330 1335 



Pro Gin He Thr Ser Lys Tyr Ser Ser Asp Asn Leu Ala He Thr 
1340 1345 1350 



Leu His Asn Ala Ala Phe Thr Val Arg Tyr Asp Gly Ser Gly Asn 
1355 1360 1365 



Val He Arg Asn Lys Gin He Ser Ala Met Lys Leu Thr Gly Val 
1370 1375 1380 



Asp Gly Lys Ser Gin Tyr Gly Asn Ala Phe He He Ala Asn Thr 
1385 1390 1395 



Val Lys His Tyr Gly Gly Tyr Ser Asp Leu Gly Gly Pro He Thr 
1400 1405 1410 



Val Tyr Asn Lys Thr Lys Asn Tyr He Ala Ser Val Gin Gly His 
1415 1420 1425 



Leu Met Asn Ala Asp Tyr Thr Arg Arg Leu He Leu Thr Pro Val 
1430 1435 1440 



Glu Asn Asn Tyr Tyr Ala Arg Leu Phe Glu Phe Pro Phe Ser Pro 
1445 1450 1455 



Asn Thr He Leu Asn Thr Val Phe Thr Val Gly Ser Asn Lys Thr 
1460 1465 1470 



Ser Asp Phe Lys Lys Cys Ser Tyr Ala Val Asp Gly Asn Asn Ser 
1475 1480 1485 



Gin Gly Phe Gin He Phe Ser Ser Tyr Gin Ser Ser Gly Trp Leu 
1490 1495 1500 



Asp He Asp Thr Gly He Asn Asn Thr Asp He Lys He Thr Val 
1505 1510 1515 



Met Ala Gly Ser Lys Thr His Thr Phe Thr Ala Ser Asp His He 
1520 1525 1530 



Ala Ser Leu Pro Ala Asn Ser Phe Asp Ala Met Pro Tyr Thr Phe 
1535 1540 1545 



Lys Pro Leu Glu He Asp Ala Ser Ser Leu Ala Phe Thr Asn Asn 
1550 1555 1560 
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He Ala Pro Leu Asp He Val Phe Glu Thr Lys Ala Lys Asp Gly 
1565 1570 ' * ' 



1575 



Arg Val Leu Gly Lys He Lys Gin Thr Leu Ser Val Lys Arq Val 
1580 1585 1590 

Asn Tyr Asn Pro Glu Asp He Leu Phe Leu Arg Glu Thr His Ser 
1595 1600 160 5 

Gly Ala Gin Tyr Met Gin Leu Gly Val Tyr Arg He Arg Leu Asn 
1610 1615 1620 

Thr Leu Leu Ala Ser Gin Leu Val Ser Arg Ala Asn Thr Gly lie 
1625 1630 163s 

Asp Thr He Leu Thr Met Glu Thr Gin Arg Leu Pro Glu Pro Pro 
1540 1645 1650 

Leu Gly Glu Gly Phe Phe Ala Asn Phe Val Leu Pro Lys Tyr Asd 
1655 166 0 1S65 

Pr ° ™ G1U H±S Gly ASP G1U ^9 Tr P Phe L V S He His He Gly 
1670 1675 1680 

Asn Val Gly Gly Asn Thr Gly Arg Gin Pro Tyr Tyr Ser Gly Met 
1685 1690 



1695 



Leu Ser Asp Thr Ser Glu Thr Ser Met Thr Leu Phe Val Pro Tyr 
1700 1705 1?10 

Ala Glu Gly Tyr Tyr Met His Glu Gly Val Arg Leu Gly Val Gly 
1715 1720 1725 

^ ™ LyS 116 ^ Asp Asn Thr Trp Glu Ser Ala Phe Phe 

1730 1735 1?40 

Tyr Phe Asp Glu Thr Lys Gin Gin Phe Val Leu He Asn Asp Ala 
1/4:> 1750 i7 5 5 

*^ ™ ASP Ser Gly Met Thr Gln Gln G1 Y n e Val Lys Asn He 
1760 i 7 65 177Q 

Lys Lys Tyr Lys Gly Phe Leu Asn Val Ser He Ala Thr Gly Tyr 
1775 i7 80 1785 * r 
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Ser Ala Pro Met Asp Phe Asn Ser Ala Ser Ala Leu Tyr Tyr Trp 
1790 1795 1800 



Glu Leu Phe Tyr Tyr Thr Pro Met Met Cys Phe Gin Arg Leu Leu 
• 1805 1810 1815 



Gin Glu Lys Gin Phe Asp Glu Ala Thr Gin Trp lie Asn Tyr Val 
1820 1825 1830 



Tyr Asn Pro Ala Gly Tyr He Val Asn Gly Glu He Ala Pro Trp 
1835 1840 1845 



lie Trp Asn Cys Arg Pro Leu Glu Glu Thr Thr Ser Trp Asn Ala 
1850 1855 1860 



Asn Pro Leu Asp Ala He Asp Pro Asp Ala Val Ala Gin Asn Asp 
1865 1870 1875 



Pro Met His Tyr Lys He Ala Thr Phe Met Arg Leu Leu Asp Gin 
1880 1885 1890 



Leu He Leu Arg Gly Asp Met Ala Tyr Arg Glu Leu Thr Arg Asp 
1895 1900 1905 



Ala Leu Asn Glu Ala Lys Met Trp Tyr Val Arg Thr Leu Glu Leu 
1910 1915 1920 



Leu Gly Asp Glu Pro Glu Asp Tyr Gly Ser Gin Gin Trp Ala Ala 
1925 1930 ' 1935 



Pro Ser Leu Ser Gly Ala Ala Ser Gin Thr Val Gin Ala Ala Tyr 
1940 1945 1950 



Gin Gin Asp Leu Thr Met Leu Gly Arg Gly Gly Val Ser Lys Asn 
1955 I960 1965 



Leu Arg Thr Ala Asn Ser Leu Val Gly Leu Phe Leu Pro Glu Tyr 
1970 1975 * 1980 



Asn Pro. Ala Leu Thr Asp Tyr Trp Gin Thr Leu Arg Leu Arg Leu 
1985 1990 1995 



Phe Asn Leu Arg His Asn Leu Ser He Asp Gly Gin Pro Leu Ser 
2000 2005 2010 
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Leu Ala lie Tyr Ala Glu Pro Thr Asp Pro Lys Ala Leu Leu Thr 
2015 2020 2025 

Ser Met Val Gin Ala Ser Gin Gly Gly Ser Ala Val Leu Pro Gly 
2030 2035 2040 

Thr Leu Ser Leu Tyr Arg Phe Pro Val Met Leu Glu Arg Thr Arg 
2045 2050 2055 

Asn Leu Val Ala Gin Leu Thr Gin Phe Gly Thr Ser Leu Leu Ser 
2060 2065 2070 

Met Ala Glu His Asp Asp Ala Asp Glu Leu Thr Thr Leu Leu Leu 
2075 2080 2085 

Gin Gin Gly Met Glu Leu Ala Thr Gin Ser lie Arg He Gin Gin 
2090 2095 2100 



Arg Thr Val Asp Glu Val Asp Ala Asp He Ala Val Leu Ala Glu 
2105 2110 2115 



Ser Arg Arg Ser Ala Gin Asn Arg Leu Glu Lys Tyr Gin Gin Leu 
2120 2125 2130 



Tyr Asp Glu Asp He Asn His Gly Glu Gin Arg Ala Met Ser Leu 
2 135 2140 2145 



Leu Asp Ala Ala Ala Gly Gin Ser Leu Ala Gly Gin Val Leu Ser 
2150 2155 2160 

He Ala Glu Gly Val Ala Asp Leu Val Pro Asn Val Phe Gly Leu 
2165 2170 2175 

Ala Cys Gly Gly Ser Arg Trp Gly Ala Ala Leu Arg Ala Ser Ala 
2180 2185 2190 

Ser Val Met Ser Leu Ser Ala Thr Ala Ser Gin Tyr Ser Ala Asp 
2195 2200 2205 



Lys He Ser Arg Ser Glu Ala Tyr Arg Arg Arg Arg Gin Glu Trp 
2210 2215 2220 

Glu He Gin Arg Asp Asn Ala Asp Gly Glu Val Lys Gin Met Asp 
2225 2230 2 235 



Ala Gin Leu Glu Ser Leu Lys He Arg Arg Glu Ala Ala Gin Met 
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2240 2245 2250 



Gin Val Glu Tyr Gin Glu Thr Gin Gin Ala His Thr Gin Ala Gin 
2255 2260 2265 



Leu Glu Leu Leu Gin Arg Lys Phe Thr Asn Lys Ala Leu Tyr Ser 
2270 2275 2280 



Trp Met Arg Gly Lys Leu Ser Ala lie Tyr Tyr Gin Phe Phe Asp 
2285 2290 2295 



Leu Thr Gin Ser Phe Cys Leu Met Ala Gin Glu Ala Leu Arg Arg 
2300 2305 . 2310 



Glu Leu Thr Asp Asn Gly Val Thr Phe lie Arg Gly Gly Ala Trp 
2315 2320 2325 



Asn Gly Thr Thr Ala Gly Leu Met Ala Gly Glu Thr Leu Leu Leu 
2330 2335 2340 



Asn Leu Ala Glu Met Glu Lys Val Trp Leu Glu Arg Asp Glu Arg 
2345 2350 2355 



Ala Leu Glu Val Thr Arg Thr Val Ser Leu Ala Gin Phe Tyr Gin 
2360 2365 2370 



Ala Leu Ser Ser Asp Asn Phe Asn Leu Thr Glu Lys Leu Thr Gin 
2375 2380 2385 



Phe Leu Arg Glu Gly Lys Gly Asn Val Gly Ala Ser Gly Asn Glu 
2390 2395 2400 



Leu Lys Leu Ser Asn Arg Gin lie Glu Ala Ser Val Arg Leu Ser 
2405 2410 2415 



Asp Leu Lys lie Phe Ser Asp Tyr Pro Glu Ser Leu Gly Asn Thr 
2420 2425 2430 



Arg Gin Leu Lys Gin Val Ser Val Thr Leu Pro Ala Leu Val Gly 
2435 2440 2445 



Pro Tyr Glu Asp lie Arg Ala Val Leu Asn Tyr Gly Gly Ser lie 
2450 2455 2460 



Val Met Pro Arg Gly Cys Ser Ala He Ala Leu Ser His Gly Val 
2465 2470 2475 
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Asn Asp Ser Gly Gin Phe Met Leu Asp Phe Asn Asp Ser Arq Tyr 
2480 2485 2490 

Leu Pro Phe Glu Gly He Ser Val Asn Asp Ser Gly Ser Leu Thr 
2495 2500 2505 



Leu Ser Phe Pro Asp Ala Thr Asp Arg Gin Lys Ala Leu Leu Glu 
2510 2515 2520 

Ser Leu Ser Asp He He Leu His He Arg Tyr Thr He Arg Ser 
2525 2530 2535 



